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INVASION ASSOCIATED GENES FROM NEISSERIA 
5 MENINGITIDIS SEROGROUP B 

FIELD OF THE INVENTION 
The invention relates to new genes isolated from Neisseria meningitidis. 
Isolated nucleic acids, probes, expression cassettes, polypeptides, antibodies, 
immunogenic compositions, antisense nucleic acids, amplification mixtures and new 
10 invasion deficient strains of Neisseria meningitidis. The invention also relates to methods 
of detecting Neisseria meningitidis and Neisseria meningitidis nucleic acids, and to 
methods of inhibiting the invasion of mammalian cells by Neisseria meningitidis. 

BACKGROUND OF THE INVENTION 
Neisseria meningitidis, a Gram-negative encapsulated diplococcus, is an 
15 obligate human pathogen and the causative agent of meningococcal .meningitis, one of the 
most devastating forms of meningitis. These bacteria are isolated from humans 
worldwide and can cause sporadic and epidemic disease. Person-to-person transfer of N. 
meningitidis occurs mainly via the airborne route, and is particularly a problem in places 
where people are in close quarters, such as prisons, military camps, school class rooms, 
20 and day care centers. At any one time, between 2 and 10% of individuals in the 

population carry this organism asymptomatically (Greenfield, S., et al. (1971), J. Infec. 
Dis., 123:67-73; Moore, P.S., et al. (November 1994), Scientific American, p38-45; 
Romero, J.D., et al (1994), Clinical Microbiology Review, 7:559-575). With such a 
high carrier rate, the threat or potential for outbreaks or epidemics is always present. 
25 Although significant advances have been made in the area of the pathogenesis of the 
organism, there is much to be learned about the genetics and cell biology of the host- 
parasite interaction. 

Understanding the mechanism(s) of attachment and invasion is one of the 
most important aspects in N. meningitidis disease. In order to cause disease, 
30 meningococci must survive and colonize the mucosa of the nasopharynx, pass through 
these tissues into the bloodstream, replicate to large numbers in the blood, cross the 
blood-brain barrier and multiply in the cerebrospinal fluid (CFS) where they cause 
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inflammation of the meninges. Various models have been used in order to mimic the 
events that take place during infection in humans. Mouse models (Miller, CP. (1933), 
Science, 78:340-341; Holbein, B.E. (1981), Can. J. Microbiol, 27:738-741; Salit, I.E. 
(1984), Can. J. Microbiol., 30:1022-1029), human nasopharyngeal organ culture 
5 (Stephens, D.S., et al (1991), Rev Infect Dis., 13:22-33), chick embryo (Buddingh, 

G.J., et al. (1987), Science, 86:20-21; Pine, L., et al., Micrbiol. Lett., 130:37-44), and 
tissue culture monolayer and bilayer systems (Birkness, K.A., et al (1995), Infect. 
Immun., 63:402-409) represent some of the models commonly used to study virulence of 
N. meningitidis. 

10 The organ culture system has been used successfully to assess the 

attachment and invasion properties of various N. meningitidis strains (Salit, I.E. (1984), 
Can. J. Microbiol, 30:1022-1029). 

Designated by serogroup, serological classification of N. meningitidis is 
based on the capsular polysaccharide composition of the particular strain. Among the 

15 meningococci there are at least thirteen different serogroups: A, B, C, 29-E, H, I, K, L, 
W135, X, Y and Z. Of these serogroups, A, B and C comprise over 90% of the strains 
isolated from patients afflicted with meningococcal meningitis (Poolman, J.T., et al 
(1995), Infectious Agents and Disease, 4: 13-28). The nature of the capsule in serogroups 
A and C has led to the development of useful vaccines against these serogroups. 

20 However, the serogroup B capsular polysaccharide does not induce protection in humans. 
Many laboratories around the world are concentrating their efforts on the study and 
characterization of epitopes from various membrane and other extracellular factors for 
use as vaccine candidates. Some of the most common non -capsule factors in such studies 
include a number of outer membrane proteins (OMP) such as class 1 (Por A, a cation 

25 specific porin), class 2 or 3 (Por B, an anion specific protein) and to a lesser extent class 
4 and class 5 OMPs (Rmp, and Opc and Opa opacity associated proteins, respectively). 
While class 5 Opc and Opa OMPs have been shown to play roles in the invasion of 
epithelial cells (Virji, M., et al. (1992), Mol. Microbiol, 6:2785-95) due to their 
antigenic and phase variability (Aho, E.L., et al (1991), Mol Microbiol, 5:1429-37), 

30 they are not considered to be good vaccine candidates. 

Class 1 OMPs appear to be good candidates for vaccine studies since these 
proteins have been shown to induce protective immunity. Evaluation of various non- 
capsular antigens as potential vaccine candidates in in vitro bactericidal assays and an 
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infant rat model revealed that class 1 OMP had the highest protective capacity compared 
to factors such as LPS and class 2/3 OMPs (Saukkonen, K., et al (1989), Vaccine, 
7:325-328). However, preliminary data from vaccine trial studies suggests that these 
factors do not elicit a complete immune response, especially in children (Romero, J.D., 
5 et al. (1994), Clinical Microbiology Review, 7:559-575; Poolman, J.T., et al. (1995), 
Infectious Agents and Disease, 4: 13-28). The development of fusion or hybrid genes 
containing epitopes from class 1 OMP show great promise as vaccine candidates (Van 
der Ley, P., et al (1992), Infect. Immun., 60:3156-3161; Van der Ley, P., et al. 
(1993), Infect. Immun., 61:4217-4224). However, these hybrids do not elicit protection 

10 in infants, and the immunity induced is type specific and very short-lived (Poolman, 
J.T., et al. (1995), Infectious Agents and Disease, 4:13-28). For these and other 
reasons, it is of importance to identify alternative serogroup B vaccine antigens. Initial 
attachment and invasion by the pathogen is critical to the disease process. If mucosal 
immunity can be derived against these bacterial factors, the disease process and the 

15 carrier state can be prevented. The present invention provides these and other features. 

SUMMARY OF THE INVENTION 
The invention provides nucleic acids and encoded polypeptides associated 
with invasion of Neisseria meningitidis. The polypeptides are used as diagnostic 
reagents, as immunogenic reagents, and as components of vaccines. The nucleic acids 

20 are used as diagnostic reagents, as components of vectors and vaccines, and to encode 
the polypeptides of the invention. The invention also provides strains of Neisseria 
meningitidis which have an invasion deficient phenotype. 

In one embodiment, the invention provides isolated nucleic acids encoding 
the polypeptides of the invention, including ORF 1 (SEQ ID NO:2), ORF 2 (ORF2a 

25 (SEQ ID NO:4) and ORF2b (SEQ ID NO: 5), two separate embodiments depending on 
alternate start sites for the ORF2 polypeptide), ORF 3 (SEQ ID NO:7) and, 
conservatively modified variations of each of the polypeptides. Exemplar nucleic acids 
include Seq 1 (SEQ ID NO:l), Seq 2 (SEQ ID NO:3), and Seq 3 (SEQ ID NO:7) (see, 
Figures 5, 6, and 7 respectively). Other nucleic acids encoding the same polypeptides 

30 include those with silent codon substitutions relative to Seq 1 (SEQ ID NO:l), Seq 2 

(SEQ ID NO: 3), or Seq 3 (SEQ ID NO: 6), as well as conservatively modified variations 
thereof. 

Isolated nucleic acids which hybridize under stringent conditions to the 
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exemplar nucleic acids Seq 1 (SEQ ID NO: 1), Seq 2 (SEQ ID NO:3), or Seq 3 (SEQ ID 
NO: 6) are also provided. For example, a complementary nucleic acid to a sequence 
provided by Seq 1 (SEQ ID NO:l), Seq 2 (SEQ ID NO:3), or Seq 3 (SEQ ID NO:6) 
hybridizes to Seq 1 (SEQ ID NO:l), Seq 2 (SEQ ID NO:3), or Seq 3 (SEQ ID NO:6), 
respectively. Nucleic acids which include substantial subsequences complementary to 
Seq 1 (SEQ ID NO:l), Seq 2 (SEQ ID NO:3), or Seq 3 (SEQ ID NO:6) also hybridize 
to Seq 1 (SEQ ID NO:l), Seq 2 (SEQ ID NO:3), or Seq 3 (SEQ ID NO:6), respectively. 

Isolated nucleic acids which hybridize under stringent conditions to Seq 4 
(SEQ ID NO:8) are provided. Seq 4 (SEQ ID NO:8) is a genomic sequence which 
encodes Seq 1 (SEQ ID NO:l), Seq 2 (SEQ ID NO:3), and Seq 3 (SEQ ID NO:6). 
Thus, complementary nucleic acids to sequences provided by Seq 1 (SEQ ID NO:l), Seq 

2 (SEQ ID NO:3), Seq 3 (SEQ ID NO:6), or Seq 4 (SEQ ID NO:8) all hybridize to Seq 
4 (SEQ ID NO: 8) under stringent conditions. Similarly, nucleic acids which include 
substantial subsequences of Seq 1 (SEQ ID NO:l), Seq 2 (SEQ ID NO:3), Seq 3 (SEQ 
ID NO:6) or Seq 4 (SEQ ID NO:8) also hybridize to Seq 4 (SEQ ID NO:8). The 
isolated nucleic acids are optionally vector nucleic acids which comprise a transcription 
cassette. The transcription cassette optionally encodes a polypeptide. Typically, the 
portion of the transcription cassette which encodes the polypeptide hybridizes to Seq 4 
(SEQ ID NO: 8) under stringent conditions. Upon transduction of the transcription 
cassette into a cell, an mRNA which hybridizes to Seq 4 (SEQ ID NO:8) under stringent 
conditions is produced. The mRNA is translated in the cell into a polypeptide, such as 
the ORF 1 (SEQ ID NO:2), ORF 2a (SEQ ID NO:4), ORF 2b (SEQ ID NO:5) or ORF 

3 (SEQ ID NO: 7) polypeptides. 

Polypeptides encoded by nucleic acids which hybridize under stringent 
conditions to Seq 4 (SEQ ID NO:8), including Seq 1 (SEQ ID NO:l), Seq 2 (SEQ ID 
NO:3), Seq 3 (SEQ ID NO:7) are provided herein. Exemplar polypeptides include ORF 
1 (SEQ ID NO: 1), ORF 2a (SEQ ID NO:4), ORF 2b (SEQ ID NO:5) or ORF 3 (SEQ 
ID NO:6). 

Full length polypeptides of the invention, or antigenic epitopes derived 
from the full length polypeptides of the invention are optionally present in immunogenic 
compositions. The antigenic epitopes are optionally incorporated into fusion proteins, 
which optionally include antigenic epitopes from related or unrelated proteins. The 
antigenic epitopes are optionally expressed on the surface of antigenic viral vectors. 
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The immunogenic compositions optionally comprise components to 
enhance immunogenicity, such as an adjuvant. The compositions optionally include 
pharmaceutical^ acceptable excipients. When administered to a mammal, the 
immunogenic compositions optionally provide an immune response against antigenic 
epitopes which are included in the immunogenic compositions. In one preferred 
embodiment, administration of the immunogenic composition of the invention to a 
mammal inhibits invasion of the cells of the mammal by Neisseria meningitidis. 

Antibodies which specifically bind to the polypeptides of the invention are 
provided. In a preferred embodiment, the antibodies bind to a polypeptide such as ORF 
1 (SEQ ID NO:2), ORF 2a (SEQ ID NO:4), ORF 2b (SEQ ID NO:5), or ORF 3 (SEQ 
ID NO:7), without binding to the E coli FtsZ protein, or to the E coli UNK protein. 
Typically, the antibodies specifically bind to the ORF 1 (SEQ ID NO:2), ORF 2a (SEQ 
ID NO:4), ORF 2b (SEQ ID NO:5), or ORF 3 (SEQ ID NO:7) proteins. 

The invention provides isolated Neisseria meningitidis diplococcus. The 
diplococcus has a reduced ability to invade tissue culture epithelial cells in vitro as 
compared to a wild-type Neisseria meningitidis diplococcus and the genome of the 
isolated Neisseria meningitidis diplococcus has a modification in the region of the 
genome corresponding to Seq 4 (SEQ ID NO:8). In one embodiment, the isolated 
Neisseria meningitidis diplococcus comprises a transposon insertion in the region of the 
genome corresponding to Seq 4 (SEQ ID NO:8). 

The invention provides a variety of assays for detecting Neisseria 
meningitidis, including PCR assays, northern blots, Southern blots, western blots and 
ELISA assays. For example, the invention provides PCR reaction mixtures using 
template nucleic acids which hybridize to Seq 4 (SEQ ID NO:8) under stringent 
conditions. The mixture has a primer pair which hybridizes to the template nucleic acid, 
wherein the primers, when hybridized to the template, serve as initiation sites for primer 
extension by a thermostable polymerase such as taq or vent DNA polymerase. The 
products of PCR amplification are detected by detecting the amplified nucleic acid 
products (amplicons) of the PCR reaction. 

In several methods relying on nucleic acid hybridization, the detection of a 
Neisseria meningitidis nucleic acid in a biological sample is performed by contacting a 
probe nucleic acid to the sample and detecting binding of the nucleic acid to the Neisseria 
meningitidis nucleic acid. The probe hybridizes to Seq 4 (SEQ ID NO:8), or the 
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complement thereof. Many assay formats are appropriate, including northern and 
Southern blotting. 

In one embodiment, the invention provides methods of inhibiting the 
invasion of a mammalian cell by Neisseria meningitidis by expressing an anti-sense RNA 
molecule in the mammalian cell. The antisense RNA molecule hybridizes to a nucleic 
acid which hybridizes under stringent conditions to a nucleic acid encoded by Seq 1 
(SEQ ID NO: 1), Seq 2 (SEQ ID NO:3), Seq 3 (SEQ ID NO:7), or Seq 4 (SEQ ID 
NO: 8). Such anti sense molecules optionally comprise catalytic RNA ribonuclease 
domains, such as those derived from a ribozyme. 

BRIEF DESCRIPTION OF THE DRAWING 
Figure 1 is a schematic of the region from Neisseria meningitidis 
surrounding the Tn916 transposon from VVV6. 

Figure 2 is a graph of the attachment-invasion assay performed on the 
HEC-l-B cell line. 

Figure 3 is a graph of the attachment-invasion assay performed on the 
HEC-l-B cell line with VVV6 and related recombinant Neisseria meningitidis. 

Figure 4 shows the sequence of Seq 4 (SEQ ID NO: 8), with ribosome 
binding sites (RBS), start sites and stop sites for ORF 1 (SEQ ID NO:7), ORF 2a (SEQ 
ID NO:4), ORF 2b (SEQ ID NO:5),and ORF 3 (SEQ ID NO:2). 

Figure 5 shows the sequence of Seq 1 (SEQ ID NO:l) {see, the nucleic 
acid sequence of the open reading frame) and the corresponding amino acid sequence 
ORF 1 (SEQ ID NO:2). 

Figure 6 shows the sequence of Seq 2 (SEQ ID NO:3) {see, the nucleic 
acid sequence of the open reading frame) and the corresponding amino acid sequences of 
ORF 2a (SEQ ID NO:4) and ORF 2b (SEQ ID NO:5). 

Figure 7(a) shows the sequence of Seq 3 (SEQ ID NO:7) {see, the nucleic 
acid sequence of the open reading frame) and the corresponding amino acid sequence 
ORF 3 (SEQ ID NO:7). Figure 7(b) provides an alternate embodiment of ORF 3. 

Figure 8 shows the alignment of nucleic acid sequences encoding ORF 1 
(Seq 1; SEQ ID NO:l), ORF 2 (Seq 2; SEQ ID NO:3), and ORF 3 (Seq 3; SEQ ID 
NO:6) with Seq 4 (PATENT. SEQ; SEQ ID NO:8). 
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DEFINITIONS 

Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as commonly understood by one of ordinary skill in the art to 
which this invention belongs. Singleton et al. (1994) Dictionary of Microbiology and 
Molecular Biology, second edition, John Wiley and Sons (New York); Walker (ed) 
(1988) The Cambridge Dictionary of Science and Technology, The press syndicate of the 
University of Cambridge, NY; and Hale and Marham (1991) The Harper Collins 
Dictionary of Biology, Harper, Perennial, NY provide one of skill with a general 
dictionary of many of the terms used in this invention. Although any methods and 
materials similar or equivalent to those described herein can be used in the practice or 
testing of the present invention, certain preferred methods and materials are described. 
For purposes of the present invention, the following terms are defined below. 

The terms "isolated" or "biologically pure" refer to material which is 
substantially or essentially free from components which normally accompany it as found 
in its native state. 

The term "nucleic acid" refers to a deoxyribonucleotide or ribonucleotide 
polymer in either single- or double-stranded form, and unless otherwise limited, 
encompasses known analogues of natural nucleotides that hybridize to nucleic acids in 
manner similar to naturally occurring nucleotides. Unless otherwise indicated, a 
particular nucleic acid sequence optionally includes the complementary sequence thereof. 

The term "subsequence" in the context of a particular nucleic acid 
sequence refers to a region of the nucleic acid equal to or smaller than the specified 
nucleic acid. Thus, for example, a viral inhibitor nucleic acid subsequence is a 
subsequence of a vector nucleic acid, because, in addition to encoding the viral inhibitor, 
the vector nucleic acid optionally encodes other components such as a promoter, a 
packaging site, chromosome integration sequences and the like. 

Two single-stranded nucleic acids "hybridize" when they form a double- 
stranded duplex. The region of double-strandedness can include the full-length of one or 
both of the single-stranded nucleic acids, or all of one single stranded nucleic acid and a 
subsequence of the other single stranded nucleic acid, or the region of double- 
strandedness can include a subsequence of each nucleic acid. An overview to the 
hybridization of nucleic acids is found in Tijssen (1993) Laboratory Techniques in 
Biochemistry and Molecular Biology-Hybridization with Nucleic Acid Probes part I 
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chapter 2 "Overview of principles of hybridization and the strategy of nucleic acid probe 
assays", Elsevier, New York. 

"Stringent hybridization wash conditions" in the context of nucleic acid 
hybridization experiments such as Southern and northern hybridizations are sequence 
dependent, and are different under different environmental parameters. An extensive 
guide to the hybridization of nucleic acids is found in Tijssen (1993) Laboratory 
Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid 
Probes, part I, chapter 2, "Overview of principles of hybridization and the strategy of 
nucleic acid probe assays", Elsevier, New York. Generally, highly stringent wash 
conditions are selected to be about 5° C lower than the thermal melting point (TJ for 
the specific sequence at a defined ionic strength and pH. The T m is the temperature 
(under defined ionic strength and pH) at which 50% of the target sequence hybridizes to 
a perfectly matched probe. Very stringent conditions are selected to be equal to the T m 
point for a particular probe. Nucleic acids which do not hybridize to each other under 
stringent conditions are still substantially identical if the polypeptides which they encode 
are substantially identical. This occurs, e.g. , when a copy of a nucleic acid is created 
using the maximum codon degeneracy permitted by the genetic code. 

The term "identical" in the context of two nucleic acid or polypeptide 
sequences refers to the residues in the two sequences which are the same when aligned 
for maximum correspondence. A nucleic acid is "substantially identical to a reference 
nucleic acid when it is at least about 70% identical, preferably at least about 80% 
identical, and optionally about 90% identical or more. When percentage of sequence 
identity is used in reference to proteins or peptides it is recognized that residue positions 
which are not identical often differ by conservative amino acid substitutions, where 
amino acids residues are substituted for other amino acid residues with similar chemical 
properties (e.g., charge or hydrophobicity) and therefore do not change the functional 
properties of the molecule. Where sequences differ in conservative substitutions, the 
percent sequence identity may be adjusted upwards to correct for the conservative nature 
of the substitution. Means for making this adjustment are well known to those of skill in 
the art. Typically this involves scoring a conservative substitution as a partial rather than 
a full mismatch, thereby increasing the percentage sequence identity. Thus, for example, 
where an identical amino acid is given a score of 1 and a non-conservative substitution is 
given a score of zero, a conservative substitution is given a score between zero and 1. 
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The scoring of conservative substitutions is calculated, e.g., according to known 
algorithm. See, e.g., Meyers and Miller, Computer Applic. Biol. Sci., 4: 11-17 (1988); 
Smith and Waterman (1981) Adv. Appl. Math. 2: 482; Needleman and Wunsch (1970) /. 
MoL Biol. 48: 443; Pearson and Lipman (1988) Proc. Natl. Acad. ScL USA 85: 2444; 
5 Higgins and Sharp (1988) Gene, 73: 237-244 and Higgins and Sharp (1989) CABIOS 5: 
151-153; Corpet, et al (1988) Nucleic Acids Research 16, 10881-90; Huang, et al. 
(1992) Computer Applications in the Biosciences 8, 155-65, and Pearson, et al. (1994) 
Methods in Molecular Biology 24, 307-31. Alignment is also often performed by 
inspection and manual alignment. 
10 "Conservatively modified variations" of a particular nucleic acid sequence 

refers to those nucleic acids which encode identical or essentially identical amino acid 
sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical sequences. Because of the degeneracy of the genetic code, a large 
number of functionally identical nucleic acids encode any given polypeptide. For 
15 instance, the codons CGU, CGC, CGA, CGG, AGA, and AGG all encode the amino 
acid arginine. Thus, at every position where an arginine is specified by a codon, the 
codon can be altered to any of the corresponding codons described without altering the 
encoded polypeptide. Such nucleic acid variations are "silent variations," which are one 
species of "conservatively modified variations." Every nucleic acid sequence herein 
20 which encodes a polypeptide also describes every possible silent variation. One of skill 
will recognize that each codon in a nucleic acid (except AUG, which is ordinarily the 
only codon for methionine) can be modified to yield a functionally identical molecule by 
standard techniques. Accordingly, each "silent variation" of a nucleic acid which 
encodes a polypeptide is implicit in each described sequence. Furthermore, one of skill 
25 will recognize that individual substitutions, deletions or additions which alter, add or 

delete a single amino acid or a small percentage of amino acids (typically less than 5%, 
more typically less than 1 %) in an encoded sequence are "conservatively modified 
variations" where the alterations result in the substitution of an amino acid with a 
chemically similar amino acid. Conservative substitution tables providing functionally 
30 similar amino acids are well known in the art. The following six groups each contain 
amino acids that are conservative substitutions for one another: 

1) Alanine (A), Serine (S), Threonine (T); 

2) Aspartic acid (D), Glutamic acid (E); 
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3) Asparagine (N), Glutamine (Q); 

4) Arginine (R), Lysine (K); 

5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V); and 

6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W). 

The term "antibody" refers to a polypeptide substantially encoded by an 
immunoglobulin gene or immunoglobulin genes, or fragments thereof. The recognized 
immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, epsilon and mu 
constant region genes, as well as myriad immunoglobulin variable region genes. Light 
chains are classified as either kappa or lambda. Heavy chains are classified as gamma, 
mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, IglvL 
IgA, IgD and IgE, respectively. 

An exemplar immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each 
pair having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The 
N-terminus of each chain defines a variable region of about 100 to 1 10 or more amino 
acids primarily responsible for antigen recognition. The terms variable light chain (VJ 
and variable heavy chain (V^ refer to these light and heavy chains respectively. 

Antibodies exist e.g., as intact immunoglobulins or as a number of well 
characterized fragments produced by digestion with various peptidases. Thus, for 
example, pepsin digests an antibody below the disulfide linkages in the hinge region to 
produce F(ab)' 2 a dimer of Fab which itself is a light chain joined to V H -C H 1 by a 
disulfide bond. The F(ab)* 2 may be reduced under mild conditions to break the disulfide 
linkage in the hinge region thereby converting the F(ab)' 2 dimer into an Fab' monomer. 
The Fab' monomer is essentially an Fab with part of the hinge region (see, Fundamental 
Immunology, Third Edition, W.E. Paul, ed., Raven Press, N.Y. (1993), which is 
incorporated herein by reference, for a more detailed description of other antibody 
fragments). While various antibody fragments are defined in terms of the digestion of an 
intact antibody, one of skill will appreciate that such Fab' fragments may be synthesized 
de novo either chemically or by utilizing recombinant DNA methodology. Thus, the 
term antibody, as used herein, also includes antibody fragments either produced by the 
modification of whole antibodies or those synthesized de novo using recombinant DNA 
methodologies. 

A "chimeric antibody" is an antibody molecule in which (a) the constant 
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region, or a portion thereof, is altered, replaced or exchanged so that the antigen binding 
site (variable region) is linked to a constant region of a different or altered class, effector 
function and/or species, or an entirely different molecule which confers new properties to 
the chimeric antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, ere; or (b) 
5 the variable region, or a portion thereof, is altered, replaced or exchanged with a 
variable region having a different or altered antigen specificity. 

The term "immunoassay" is an assay that utilizes an antibody to 
specifically bind an analyte. The immunoassay is characterized by the use of specific 
binding properties of a particular antibody to isolate, target, and/or quantify the analyte. 
10 An "anti-ORF" antibody is an antibody or antibody fragment that 

specifically binds a polypeptide encoded by the Neisseria meningitidis ORFs, described 
herein. 

An "expression vector" includes a recombinant expression cassette which 
includes a nucleic acid which encodes a polypeptide which can be transcribed and 

15 translated by a cell. A "recombinant expression cassette'* is a nucleic acid construct, 

generated recornbinantly or synthetically, with a series of specified nucleic acid elements 
which permit transcription of a particular nucleic acid in a target cell. The expression 
vector can be part of a plasmid, virus, or nucleic acid fragment. Typically, the 
recombinant expression cassette portion of the expression vector includes a nucleic acid 

20 to be transcribed, and a promoter. In some embodiments, the expression cassette also 
includes, e.g., an origin of replication, and/or chromosome integration elements. A 
"promoter" is an array of nucleic acid control sequences which direct transcription of a 
nucleic acid. As used herein, a promoter includes necessary nucleic acid sequences near 
the start site of transcription, such as, in the case of a polymerase II type promoter, a 

25 TATA element. The promoter also includes distal enhancer or repressor elements which 
can be located as much as several thousand base pairs from the start site of transcription. 
A "constitutive" promoter is a promoter which is active under most environmental 
conditions and states of development or cell differentiation. An "inducible" promoter 
responds to an extracellular stimulus. The term "operably linked" refers to functional 

30 linkage between a nucleic acid expression control sequence (such as a promoter, or array 
of transcription factor binding sites) and a second nucleic acid sequence, wherein the 
expression control sequence directs transcription of the nucleic acid corresponding to the 
second sequence. 
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The term "recombinant" when used with reference to a cell indicates that 
the cell replicates or expresses a nucleic acid, or expresses a peptide or protein encoded 
by a nucleic acid whose origin is exogenous to the cell. Recombinant cells can express 
genes that are not found within the native (non-recombinant) form of the cell. 
5 Recombinant cells can also express genes found in the native form of the cell wherein the 
genes are re-introduced into the cell by artificial means, for example under the control of 
a heterologous promoter. 

An "immunogenic composition" is a composition which elicits the 
production of an antibody which binds a component of the composition when 

10 administered to a mammal, or which elicits the production of a cell-mediated immune 
response against a component of the composition. 

An "antigenic epitope" in the context of a polypeptide is a polypeptide 
subsequence which, when presented as an immunogen, or as a portion of an immunogen 
(e.g., with a carrier protein or adjuvant, or on the surface of a viral vector), elicits an 

15 antibody which specifically binds to the full length polypeptide. 

DETAILED DESCRIPTION OF THE INVENTION 
Using several new tools and techniques, the identification of bacterial 
gene(s) which are involved in the process of cell adhesion and invasion are described. A 
TnP/6-mutant library of N. meningitidis, serogroup B, strain NMB (Kathariou, S., et al. 

20 MoL Microbiol., 4:729-735), was examined for the lost ability to attach or invade tissue 
culture epithelial cells (HEC1-B). Several hundred mutants were screened, and one 
strain, VVV6, showed a significant > 10-fold decrease in its ability to associate with the 
HEC1-B monolayer, compared to its parent strain, NMB. Southern hybridization, 
polymerase chain reaction, and DNA sequence analysis data revealed the presence of a 

25 single intact, Class 1 , copy of transposon Tn916. To demonstrate linkage between the 
transposon insertion site and mutant phenotype backtransformants were created via 
homologous recombination. All seven recombinants also showed an invasion-deficient 
phenotype as observed with VVV6. Nucleotide sequence analysis shows that the TnP/6 
insertion occurred between two open reading frames (ORFs). The nature or function of 

30 the products encoded by these ORFs is not known. ORF 3 (SEQ ID NO:7) shows no 
significant homology to any known gene, while ORF 2 (ORF 2a (SEQ ID NO:4); ORF 
2b (SEQ ID NO:5)) shows 60% identity to an E. coli gene with no known function. 
Adjacent to ORF 2, an open reading frame encoding ORF 1 was found. ORF 1 is the 
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Neisseria meningitidis ftz gene homologue. 

Making Neisseria meningiti dis Nucleic Acids and Polypeptides 

Several specific nucleic acids encoding Neisseria meningitidis polypeptides 
are described herein. These nucleic acids can be made using standard recombinant or 
5 synthetic techniques. Given the nucleic acids of the present invention, one of skill can 
construct a variety of clones containing functionally equivalent nucleic acids, such as 
nucleic acids which encode the same polypeptide. Cloning methodologies to accomplish 
these ends, and sequencing methods to verify the sequence of nucleic acids are well 
known in the art. Examples of appropriate cloning and sequencing techniques, and 

10 instructions sufficient to direct persons of skill through many cloning exercises are found 
in Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 
volume 152 Academic Press, Inc., San Diego, CA (Berger); Sambrook et al, (1989) 
Molecular Cloning - A Laboratory Manual (2nd ed.) Vol. 1-3; and Current Protocols in 
Molecular Biology, F.M. Ausubel et aL, eds., Current Protocols, a joint venture between 

15 Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., (1994 Supplement) 
(Ausubel). Product information from manufacturers of biological reagents and 
experimental equipment also provide information useful in known biological methods. 
Such manufacturers include the SIGMA chemical company (Saint Louis, MO), R&D 
systems (Minneapolis, MN), Pharmacia LKB Biotechnology (Piscataway, NJ), 

20 CLONTECH Laboratories, Inc. (Palo Alto, CA), Chem Genes Corp., Aldrich Chemical 
Company (Milwaukee, WI), Glen Research, Inc., GIBCO BRL Life Technologies, Inc. 
(Gaithersberg, MD), Fluka Chemica-Biochemika Analytika (Fluka Chemie AG, Buchs, 
Switzerland), Invitrogen, San Diego, CA, and Applied Biosystems (Foster City, CA), as 
well as many other commercial sources known to one of skill. 

25 The nucleic acid compositions of this invention, whether RNA, cDNA, 

genomic DNA, or a hybrid of the various combinations, are isolated from biological 
sources or synthesized in vitro. The nucleic acids of the invention are present in 
transformed or transfected cells, in transformed or transfected cell lysates, or in a 
partially purified or substantially pure form. 

30 In vitro amplification techniques suitable for amplifying sequences for use 

as molecular probes or generating nucleic acid fragments for subsequent subcloning are 
known. Examples of techniques sufficient to direct persons of skill through such in vitro 
amplification methods, including the polymerase chain reaction (PCR) the ligase chain 
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reaction (LCR), Q/?-replicase amplification and other RNA polymerase mediated 
techniques (e.g., NASBA) are found in Berger, Sambrook et al. (1989) Molecular 
Cloning - A Laboratory Manual (2nd Ed) Vol. 1-3; and Ausubel, as well as Mullis et aL, 
(1987) U.S. Patent No. 4,683,202; PCR Protocols A Guide to Methods and Applications 
5 (Innis et al. eds) Academic Press Inc. San Diego, CA (1990) (Innis); Arnheim & 

Levinson (October 1, 1990) C&EN 36-47; The Journal Of N1H Research (1991) 3, 81- 
94; (Kwoh et al (1989) Proc. Natl. Acad. Sci. USA 86, 1173; Guatelli et al. (1990) 
Proc. Natl. Acad. Sci. USA 87, 1874; Lomell et al. (1989) J. Clin. Chem 35, 1826; 
Landegren et ah, (1988) Science 241, 1077-1080; Van Brunt (1990) Biotechnology 8, 

10 291-294; Wu and Wallace, (1989) Gene 4 , 560; Barringer et al. (1990) Gene 89, 117, 
and Sooknanan and Malek (1995) Biotechnology 13: 563-564. Improved methods of 
cloning in vitro amplified nucleic acids are described in Wallace et aL, U.S. Pat. No. 
5,426,039. Improved methods of amplifying large nucleic acids are summarized in 
Cheng et al. (1994) Nature 369: 684-685 and the references therein. One of skill will 

15 appreciate that essentially any RNA can be converted into a double stranded DNA 
suitable for restriction digestion, PCR expansion and sequencing using reverse 
transcriptase and a polymerase. See, Ausbel, Sambrook and Berger, all supra. 

Oligonucleotides for use as probes, e.g., in in vitro Neisseria meningitidis 
nucleic acid amplification methods, or for use as nucleic acid probes to detect Neisseria 

20 meningitidis nucleic acids are typically synthesized chemically according to the solid 
phase phosphoramidite triester method described by Beaucage and Caruthers (1981), 
Tetrahedron Letts., 22(20): 1859-1862, e.g., using an automated synthesizer, e.g., as 
described in Needham-VanDevanter et al. (1984) Nucleic Acids Res., 12:6159-6168. 
Oligonucleotides can also be custom made and ordered from a variety of commercial 

25 sources known to persons of skill. Purification of oligonucleotides, where necessary, is 
typically performed by either native acrylamide gel electrophoresis or by anion-exchange 
HPLC as described in Pearson and Regnier (1983) J. Chrom. 255:137-149. The 
sequence of the synthetic oligonucleotides can be verified using the chemical degradation 
method of Maxam and Gilbert (1980) in Grossman and Moldave (eds.) Academic Press, 

30 New York, Methods in Enzymology 65:499-560. 

One of skill will recognize many ways of generating alterations in a given 
nucleic acid sequence. Such well-known methods include site-directed mutagenesis, PCR 
amplification using degenerate oligonucleotides, exposuie of cells containing the nucleic 
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acid to mutagenic agents or radiation, chemical synthesis of a desired oligonucleotide 
{e.g., in conjunction with ligation and/or cloning to generate large nucleic acids) and 
other well-known techniques. See, Giliman and Smith (1979) Gene 8:81-97; Roberts et 
al. (1987) Nature 328:731-734 and Sambrook et al. (1989) Molecular Cloning - A 
Laboratory Manual (2nd Ed) Vol. 1-3; Innis, Ausbel, Berger, Needham VanDevanter 
and Mullis {all supra). 

Polypeptides of the invention are optionally synthetically prepared in a 
wide variety of well-known ways. Polypeptides of relatively short size are typically 
synthesized in solution or on a solid support in accordance with conventional techniques. 
See, e.g., Merrifield (1963)7. Am. Chem. Soc. 85:2149-2154. Various automatic 
synthesizers and sequencers are commercially available and can be used in accordance 
with known protocols. See, e.g., Stewart and Young (1984) Solid Phase Peptide 
Synthesis, 2d. ed., Pierce Chemical Co. Polypeptides are also produced by recombinant 
expression of a nucleic acid encoding the polypeptide followed by purification using 
standard techniques. Solid phase synthesis in which the C-terminal amino acid of the 
sequence is attached to an insoluble support followed by sequential addition of the 
remaining amino acids in the sequence is the preferred method for the chemical synthesis 
of the polypeptides of this invention. Techniques for solid phase synthesis are described 
by Barany and Merrifield, Solid-Phase Peptide Synthesis; pp. 3-284 in The Peptides: 
Analysis, Synthesis, Biology. Vol. 2: Special Methods in Peptide Synthesis, Pan A. , 
Merrifield, et al. J. Am. Chem. Soc, 85: 2149-2156 (1963), and Stewart et al.. Solid 
Phase Peptide Synthesis, 2nd ed. Pierce Chem. Co., Rockford, 111. (1984). 
Cloning and Expressing Neisseria meningitidis Nucleic Arid* 

In a preferred embodiment, the polypeptides, or subsequences thereof, are 
synthesized using recombinant DNA methodology. Generally, this involves creating a 
DNA sequence that encodes the protein, placing the DNA in an expression cassette under 
the control of a particular promoter, expressing the protein in a host cell, isolating the 
expressed protein and, if required, renaturing the protein. 

Once a nucleic acid encoding a polypeptide of the invention is isolated and 
cloned, the nucleic acid is optionally expressed in a recombinantly engineered cells 
known to those of skill in the art. Examples of such cells include bacteria, yeast, plant, 
filamentous fungi, insect (especially employing baculoviral vectors), and mammalian 
cells. The recombinant nucleic acids are operably linked to appropriate control 



WO 98/17805 



PCT/US97/19424 



16 

sequences for expression in the selected host. For E. coli, example control sequences 
include the T7, tip, or lambda promoters, a ribosome binding site and preferably a 
transcription termination signal. For eukaryotic cells, the control sequences typically 
include a promoter and preferably an enhancer derived from immunoglobulin genes, 
5 SV40, cytomegalovirus, etc. , and a polyadenylation sequence, and may include splice 
donor and acceptor sequences. 

The plasmids of the invention can be transferred into the chosen host cell 
by well-known methods such as calcium chloride transformation for E. coli and calcium 
phosphate treatment or electroporation for mammalian cells. Cells transformed by the 
10 plasmids can be selected by resistance to antibiotics conferred by genes contained on the 
plasmids, such as the amp, gpt, neo and hyg genes. 

Once expressed, the recombinant polypeptides can be purified according to 
standard procedures of the art, including ammonium sulfate precipitation, affinity 
columns, column chromatography, gel electrophoresis and the like {see, generally, R. 
15 Scopes, Polypeptide Purification, Springer- Verlag, N.Y. (1982), Deutscher, Methods in 
Enzymology Vol. 182: Guide to Polypeptide Purification., Academic Press, Inc. N.Y. 
(1990)). Once purified, partially or to homogeneity as desired, the polypeptides may 
then be used {e.g., as immunogens for antibody production). 

After chemical synthesis, biological expression, or purification, the 
20 polypeptide(s) may possess a conformation substantially different than the native 

conformations of the constituent polypeptides. In this case, it is helpful to denature and 
reduce the polypeptide and then to cause the polypeptide to re-fold into the preferred 
conformation. Methods of reducing and denaturing polypeptides and inducing re-folding 
are well known to those of skill in the art (See, Debinski et ai (1993) J. Biol. Chem., 
25 268: 14065-14070; Kreitman and Pastan (1993) Bioconjug. Chem., 4: 581-585; and 

Buchner, et al. 9 (1992) Anal. Biochem., 205: 263-270). Debinski et al. y for example, 
describe the denaturation and reduction of inclusion body polypeptides in guanidine-DTE. 
The polypeptide is then refolded in a redox buffer containing oxidized glutathione and L- 
arginine. 

30 One of skill will recognize that modifications can be made to the 

polypeptides without diminishing their biological activity. Some modifications may be 
made to facilitate the cloning, expression, or incorporation of the targeting molecule into 
a fusion polypeptide. Such modifications are well known to those of skill in the art and 
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include, for example, a methionine added at the amino terminus to provide an initiation 
site, or additional amino acids (e.g. , poly His) placed on either terminus to create 
conveniently located restriction sites or termination codons or purification sequences. 
Making Conservative Modifications of the Nucleic Acids and Polypeptides of the 
5 Invention. 

One of skill will appreciate that many conservative variations of the 
nucleic acid and polypeptide sequences of the figures and sequence listings yield 
functionally identical products. For example, due to the degeneracy of the genetic code, 
"silent substitutions" (i.e., substitutions of a nucleic acid sequence which do not result in 

10 an alteration in an encoded polypeptide) are an implied feature of every nucleic acid 
sequence which encodes an amino acid. Similarly, "conservative amino acid 
substitutions," in one or a few amino acids in an amino acid sequence are substituted 
with different amino acids with highly similar properties (see, the definitions section, 
supra), are also readily identified as being highly similar to a disclosed amino acid 

15 sequence, or to a disclosed nucleic acid sequence which encodes an amino acid. Such 

conservatively substituted variations of each explicitly listed sequence are a feature of the 
present invention. 

One of skill will recognize many ways of generating alterations in a given 
nucleic acid sequence. Such well-known methods include site-directed mutagenesis, PCR 

20 amplification using degenerate oligonucleotides, exposure of cells containing the nucleic 
acid to mutagenic agents or radiation, chemical synthesis of a desired oligonucleotide 
(e.g., in conjunction with ligation and/or cloning to generate large nucleic acids) and 
other well-known techniques. See, Giliman and Smith (1979) Gene 8:81-97, Roberts et 
al. (1987) Nature 328:731-734 and Sambrook, Innis, Ausbel, Berger, Needham 

25 VanDevanter and Mullis (all supra). 

Most commonly, polypeptide sequences are altered by changing the 
corresponding nucleic acid sequence and expressing the polypeptide. However, 
polypeptide sequences are also optionally generated synthetically using commercially 
available peptide synthesizers to produce any desired polypeptide (see, Merrifield, and 

30 Stewart and Young, supra). 

One of skill can select a desired nucleic acid or polypeptide of the 
invention based upon the sequences provided and upon knowledge in the art regarding 
proteins generally. Knowledge regarding the nature of proteins and nucleic acids allows 
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one of skill to select appropriate sequences with activity similar or equivalent to the 
nucleic acids and polypeptides disclosed in the sequence listings herein. The definitions 
section herein describes exemplar conservative amino acid substitutions. 

Finally, most modifications to nucleic acids and polypeptides are evaluated 
by routine screening techniques in suitable assays for the desired characteristic. For 
instance, changes in the immunological character of a polypeptide can be detected by an 
appropriate immunological assay. Modifications of other properties such as nucleic acid 
hybridization to a target nucleic acid, redox or thermal stability of a protein, 
hydrophobicity, susceptibility to proteolysis, or the tendency to aggregate are all assayed 
according to standard techniques. 

Screening for Neisseria meningitidis Nucleic A cids and the Use of Neisseria menin gitidis 
Nucleic Acids as Molecular Probes 

The nucleic acids of the invention are useful as molecular probes, in 
addition to their utility in encoding the polypeptides described herein. A wide variety of 
formats and labels are available and appropriate for nucleic acid hybridization, including 
those reviewed in Tijssen (1993) Laboratory Techniques in biochemistry and molecular 
biology-hybridization with nucleic acid probes parts I and II, Elsevier, New York and 
Choo (ed) (1994) Methods In Molecular Biology Volume 33- In Situ Hybridization 
Protocols Humana Press Inc.; New Jersey (see also, other books in the Methods in 
Molecular Biology series); see especially, Chapter 21 of Choo (id) "Detection of Virus 
Nucleic Acids by Radioactive and Nonisotopic in Situ Hybridization". 

For instance, PCR, LCR, and other amplification techniques (see f supra) 
are routinely used to detect Neisseria meningitidis nucleic acids in biological samples. 
Accordingly, in one class of embodiments, the nucleic acids of the invention are used as 
primers or templates, or as positive controls in amplification reactions for the detection 
of Neisseria meningitidis in a biological samples such as cerebrospinal fluid. Briefly, 
nucleic acids with sequence identity or complementarity to Seq 4 (SEQ ID NO:8), or the 
complement thereof, are used as templates to synthetically produce oligonucleotides of 
about 15-25 nucleotides with sequences similar or identical to the complement of a 
selected Neisseria meningitidis nucleic acid subsequence. The oligonucleotides are then 
used as primers in amplification reactions such as PCR to detect selected Neisseria 
meningitidis nucleic acids in biological samples, such as a cerebrospinal fluid extract. A 
nucleic acid of the invention (i.e., a cloned nucleic acid corresponding to the region to be 
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amplified) is also optionally used as an amplification template in a separate reactions as a 
positive control to determine that the amplification reagents and hybridization conditions 
are appropriate. 

Other methods for the detection of nucleic acids in biological samples 
5 using nucleic acids of the invention include Southern blots, northern blots, in situ 

hybridization (including Fluorescent in situ hybridization (FISH), and a variety of other 
techniques overviewed in Choo (supra)). A variety of automated solid-phase detection 
techniques are also appropriate. For instance, very large scale immobilized polymer 
arrays (VLSIPS™) are used for the detection of nucleic acids. See, Tijssen (supra), 
10 Fodor et al. (1991) Science, 251: 767- 777; Sheldon et aL (1993) Clinical Chemistry 
39(4): 718-719 and Kozal et al. (1996) Nature Medicine 2(7): 753-759. 
Antibodies to selected Neisseria meningitidis ORF polypeptidefs). 

Antibodies are raised to selected Neisseria meningitidis ORF polypeptides 
of the present invention, including individual, allelic, strain, or species variants, and 
15 fragments thereof, both in their naturally occurring (full-length) forms and in 

recombinant forms. Additionally, antibodies are raised to these polypeptides in either 
their native configurations or in non-native configurations. Anti-idiotypic antibodies can 
also be generated. Many methods of making antibodies are known to persons of skill. 
The following discussion is presented as a general overview of the techniques available; 
20 however, one of skill will recognize that many variations upon the following methods are 
known. 

A number of immunogens are used to produce antibodies specifically 
reactive with Neisseria meningitidis ORF 1 (SEQ ID NO:2), ORF 2a (SEQ ID NO:4), 
ORF 2b (SEQ ID NO:5), or ORF 3 (SEQ ID NO: 7) polypeptides. Recombinant or 

25 synthetic polypeptides of 10 amino acids in length, or greater, typically 20 amino acids 
in length, or greater, more typically 30 amino acids in length, or greater, selected from 
amino acid sub-sequences of ORF 1 (SEQ ID NO:2), ORF 2a (SEQ ID NO:4), ORF 2b 
(SEQ ID NO:5), or ORF 3 (SEQ ID NO:7) are the preferred polypeptide immunogen for 
the production of monoclonal or polyclonal antibodies. In one class of preferred 

30 embodiments, an immunogenic peptide conjugate is also included as an immunogen. 
Naturally occurring polypeptides are also used either in pure or impure form. An 
antigenic domain is ordinarily at least about 3 amino acids in length, often at least about 
5 amino acids in length, generally at least about 9 amino acids in length and often at 
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least about 15 amino acids in length. The antigenic domain ordinarily includes the 
binding site for an antibody, which typically vary from 3 to about 20 amino acids in 
length, and which are generally about 8 to 12 amino acids in length. 

Recombinant polypeptides are expressed in eukaryotic or prokaryotic cells 
5 and purified using standard techniques. The polypeptide, or a synthetic version thereof, 
is then injected into an animal capable of producing antibodies. Either monoclonal or 
polyclonal antibodies can be generated for subsequent use in immunoassays to measure 
the presence and quantity of the polypeptide. 

Methods of producing polyclonal antibodies are known to those of skill in 
10 the art. In brief, an immunogen (antigen), preferably a purified polypeptide, a 

polypeptide coupled to an appropriate carrier (e.g., GST, keyhole limpet hemanocyanin, 
etc.), or a polypeptide incorporated into an immunization vector such as a recombinant 
vaccinia virus (see, U.S. Patent No. 4,722,848) is mixed with an adjuvant and animals 
are immunized with the mixture. The animal's immune response to the immunogen 
15 preparation is monitored by taking test bleeds and determining the titer of reactivity to 

the polypeptide of interest. When appropriately high titers of antibody to the immunogen 
are obtained, blood is collected from the animal and antisera are prepared. Further 
fractionation of the antisera to enrich for antibodies reactive to the polypeptide is 
performed where desired (see, e.g., Coligan (1991) Current Protocols in Immunology 
20 Wiley/Greene, NY; and Harlow and Lane (1989) Antibodies: A Laboratory Manual Cold 
Spring Harbor Press, NY). 

Antibodies, including binding fragments and single chain recombinant 
versions thereof, against whole or predetermined fragments of selected Neisseria 
meningitidis ORFs are raised by immunizing animals, e.g., with conjugates of the 
25 fragments with carrier proteins as described above. Typically, the immunogen of interest 
is a peptide of at least about 10 amino acids, more typically the peptide is 20 amino acids 
in length, generally the fragment is 25 amino acids in length and often the fragment is 30 
amino acids in length or greater. The peptides are optionally coupled to a carrier protein 
(e.g., as a fusion protein), or are recombinantly expressed in an immunization vector. 
30 Antigenic determinants on selected Neisseria meningitidis ORF peptides to which 
antibodies bind are typically 3 to 10 amino acids in length. 

Monoclonal antibodies are prepared from cells secreting the desired 
antibody. These antibodies are screened for binding to normal or modified polypeptides, 
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or screened for agonistic or antagonistic activity, e.g., activity mediated through a 
selected Neisseria meningitidis ORF polypeptide. Specific monoclonal and polyclonal 
antibodies will usually bind with a K D of at least about . 1 mM, more usually at least 
about 50 j-iM, and preferably at least about 1 /xM or better. 
5 In some instances, it is desirable to prepare monoclonal antibodies from 

various mammalian hosts, such as mice, rodents, primates, humans, etc. Description of 
techniques for preparing such monoclonal antibodies are found in, e.g. , Stites et al. 
(eds.) Basic and Clinical Immunology (4th ed.) Lange Medical Publications, Los Altos, 
CA, and references cited therein; Harlow and Lane, Supra; Goding (1986) Monoclonal 
10 Antibodies: Principles and Practice (2d ed.) Academic Press, New York, NY; and 
Kohler and Milstein (1975) Nature 256: 495-497. Summarized briefly, this method 
proceeds by injecting an animal with an immunogen. The animal is then sacrificed and 
cells taken from its spleen, which are fused with myeloma cells. The result is a hybrid 
cell or "hybridoma" that is capable of reproducing in vitro. The population of 
15 hybridomas is then screened to isolate individual clones, each of which secrete a single 
antibody species to the immunogen. In this manner, the individual antibody species 
obtained are the products of immortalized and cloned single B cells from the immune 
animal generated in response to a specific site recognized on the immunogenic substance. 

Alternative methods of immortalization include transformation with Epstein 
20 Barr Virus, oncogenes, or retroviruses, or other methods known in the art. Colonies 
arising from single immortalized cells are screened for production of antibodies of the 
desired specificity and affinity for the antigen, and yield of the monoclonal antibodies 
produced by such cells is enhanced by various techniques, including injection into the 
peritoneal cavity of a vertebrate (preferably mammalian) host. The polypeptides and 
25 antibodies of the present invention are used with or without modification, and include 
chimeric antibodies such as humanized murine antibodies. 

Other suitable techniques involve selection of libraries of recombinant 
antibodies in phage or similar vectors (see, e.g., Huse et al. (1989) Science 246: 1275- 
1281; and Ward, et al (1989) Nature 341: 544-546; and Vaughan et al. (1996) Nature 
30 Biotechnology, 14: 309-314). 

Frequently, the polypeptides and antibodies will be labeled by joining, 
either covalently or non-covalently, a substance which provides for a detectable signal. 
A wide variety of labels and conjugation techniques are known and are reported 
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extensively in both the scientific and patent literature. Suitable labels include 
radionucleotides, enzymes, substrates, cofactors, inhibitors, fluorescent moieties, 
chemiluminescent moieties, magnetic particles, and the like. Patents teaching the use of 
such labels include U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 
5 4,277,437; 4,275,149; and 4,366,241. Also, recombinant immunoglobulins may be 

produced. See, Cabilly, U.S. Patent No. 4,816,567; and Queen et al. (1989) Proc. Natl 
Acad. Sci. USA 86: 10029-10033. 

The antibodies of this invention are also used for affinity chromatography 
in isolating natural or recombinant Neisseria meningitidis ORF polypeptides. Columns 
10 are prepared, e.g., with the antibodies linked to a solid support, e.g. , particles, such as 
agarose, Sephadex, or the like, where a cell lysate is passed through the column, 
washed, and treated with increasing concentrations of a mild denaturant, whereby 
purified polypeptides are released. 

The antibodies can be used to screen expression libraries for particular 
15 expression products such as normal or abnormal Neisseria meningitidis ORF 

polypeptides, or for related polypeptides related to a selected Neisseria meningitidis ORF 
polypeptide. Optionally, the antibodies in such a procedure are labeled with a moiety 
allowing easy detection of presence of antigen by antibody binding. 

Antibodies raised against polypeptides can also be used to raise anti- 
20 idiotypic antibodies. These are useful for detecting or diagnosing various pathological 
conditions related to the presence of the respective antigens. 

The antibodies of this invention can also be administered to an organism 
(e.g., a human patient) for therapeutic purposes (e.g., to block infection by Neisseria 
meningitidis, or as targeting molecules when conjugated or fused to effector molecules 
25 such as labels, cytotoxins, enzymes, growth factors, drugs, etc.). Antibodies 

administered to an organism other than the species in which they are raised can be 
immunogenic. Thus, for example, murine antibodies administered to a human can 
induce an immunologic response against the antibody (e.g., the human anti-mouse 
antibody (HAMA) response), particularly after multiple administrations. The 
30 immunogenic properties of the antibody are reduced by altering portions, or all, of the 
antibody into characteristically human sequences thereby producing chimeric, or human, 
antibodies respectively. 

Humanized (chimeric) antibodies are immunoglobulin molecules 
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comprising a human and non-human portion. The antigen combining region (or variable 
region) of a humanized chimeric antibody is derived from a non-human source {e.g., 
murine) and the constant region of the chimeric antibody (which confers biological 
effector function, such as cytotoxicity, to the immunoglobulin) is derived from a human 
5 source. The humanized chimeric antibody has the antigen binding specificity of the 

non-human antibody molecule and the effector function conferred by the human antibody 
molecule. A large number of methods of generating chimeric antibodies are well known 
to those of skill in the art {see, e.g., U.S. Patent Nos: 5,502,167, 5,500,362, 5,491,088. 
5,482,856, 5,472,693, 5,354,847, 5,292,867, 5,231,026, 5,204,244, 5,202,238, 

10 5,169,939, 5,081,235, 5,075,431, and 4,975,369). 

In general, the procedures used to produce these chimeric antibodies 
consist of the following steps (the order of some steps interchangeable): (a) identifying 
and cloning the correct gene segment encoding the antigen binding portion of the 
antibody molecule; this gene segment (known as the VDJ, variable, diversity and joining 

15 regions for heavy chains or VJ, variable, joining regions for light chains (or simply as 
the V or Variable region) may be in either the cDNA or genomic form; (b) cloning the 
gene segments encoding the constant region or desired part thereof; (c) ligating the 
variable region with the constant region so that the complete chimeric antibody is 
encoded in a transcribable arid translatable form; (d) ligating this construct into a vector 

20 containing a selectable marker and gene control regions such as promoters, enhancers 

and poly(A) addition signals; (e) amplifying this construct in a host cell (e.g., bacteria); 
and, (f) introducing the DNA into eukaryotic cells (transfection) most often mammalian 
lymphocytes. 

Antibodies of several distinct antigen binding specificities have been 
25 manipulated by these protocols to produce chimeric proteins (e.g., anti-TNP: Boulianne 
et al. (1984) Nature, 312: 643; and anti-tumor antigens: Sahagan et al. (1986) J. 
Immunol, 137: 1066). Likewise, several different effector functions have been achieved 
by linking new sequences to those encoding the antigen binding region. Some of these 
effectors include enzymes (Neuberger et al. (1984) Nature 312: 604), immunoglobulin 
30 constant regions from another species, and constant regions of another immunoglobulin 
chain (Sharon et al (1984) Nature 309: 364; Tan et al.. (1985) J. Immunol. 135: 
3565-3567). 

In one preferred embodiment, a recombinant DNA vector is used to 
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transfect a eel! line that produces an antibody. The novel recombinant DNA vector 
contains a "replacement gene" to replace all or a portion of the gene encoding the 
immunoglobulin constant region in the cell line (e.g. , a replacement gene may encode all 
or a portion of a constant region of a human immunoglobulin, a specific immunoglobulin 
5 class, or an enzyme, a toxin, a biologically active peptide, a growth factor, inhibitor, or 
a linker peptide to facilitate conjugation to a drug, toxin, or other molecule, etc.), and a 
"target sequence" which allows for targeted homologous recombination with 
immunoglobulin sequences within the antibody producing cell. 

In another embodiment, a recombinant DNA vector is used to transfect a 

10 cell line that produces an antibody having a desired effector function, (e.g., a constant 

region of a human immunoglobulin) in which case, the replacement gene contained in the 
recombinant vector may encode all or a portion of a region of an antibody and the target 
sequence contained in the recombinant vector allows for homologous recombination and 
targeted gene modification within the antibody producing cell. In either embodiment, 

15 when only a portion of the variable or constant region is replaced, the resulting chimeric 
antibody may define the same antigen and/or have the same effector function yet be 
altered or improved so that the chimeric antibody may demonstrate a greater antigen 
specificity, greater affinity binding constant, increased effector function, or increased 
secretion and production by the transfected antibody producing cell line, etc. Regardless 

20 of the embodiment practiced, the processes of selection for integrated DNA (via a 

selectable marker), screening for chimeric antibody production, and cell cloning, can be 
used to obtain a clone of cells producing the chimeric antibody. 

Thus, a piece of DNA which encodes a modification for a monoclonal 
antibody can be targeted directly to the site of the expressed immunoglobulin gene within 

25 a B-cell or hybridoma cell line. DNA constructs for any particular modification may be 
used to alter the protein product of any monoclonal cell line or hybridoma. Such a 
procedure circumvents the task of cloning both heavy and light chain variable region 
genes from each B-cell clone expressing a useful antigen specificity. In addition to 
circumventing the process of cloning variable region genes, the level of expression of 

30 chimeric antibody is higher when the gene is at its natural chromosomal location, rather 
than at a random position in the genome. Detailed methods for preparation of chimeric 
(humanized) antibodies can be found in U.S. Patent 5,482,856. 

In another embodiment, this invention provides for fully human antibodies 
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against selected Neisseria meningitidis ORF polypeptides. Human antibodies consist 
entirely of characteristically human immunoglobulin sequences. The human antibodies of 
this invention can be produced in using a wide variety of methods (see, e.g. , Larrick et 
aL, U.S. Pat. No. 5,001,065, for review). 

In one preferred embodiment, the human antibodies of the present 
invention are produced initially in trioma cells. Genes encoding the antibodies are then 
cloned and expressed in other cells, such as nonhuman mammalian cells. 

The general approach for producing human antibodies by trioma 
technology is described by Ostberg et aL (1983), Hybridoma 2: 361-367, Ostberg, U.S. 
Pat. No. 4,634,664, and Engelman et aL, U.S. Pat. No. 4,634,666. The 
antibody-producing cell lines obtained by this method are called triomas because they are 
descended from three cells; two human and one mouse. Triomas have been found to 
produce antibody more stably than ordinary hybridomas made from human cells. 

Preparation of trioma cells requires an initial fusion of a mouse myeloma 
cell line with unimmortalized human peripheral B lymphocytes. This fusion generates a 
xenogeneic hybrid cell containing both human and mouse chromosomes {see, Engelman, 
supra,). Xenogeneic cells that have lost the capacity to secrete antibodies are selected. 
Preferably, a xenogeneic cell is selected that is resistant to a selectable marker such as 
8-azaguanine. Cells possessing resistance to 8-azaguanine are unable to propagate on 
hypoxanthine-aminopterin-thymidine (HAT) or azaserine-hypoxanthine (AH) media. 

The capacity to secrete antibodies is conferred by a further fusion between 
the xenogeneic cell and B-lymphocytes immunized against a selected Neisseria 
meningitidis ORF polypeptide, or an epitope thereof. The B-lymphocytes are obtained 
from the spleen, blood or lymph nodes of human donor. If antibodies against a specific 
antigen or epitope are desired, it is preferable to use that antigen or epitope as the 
immunogen rather than a full length polypeptide. Alternatively, B-lymphocytes are 
obtained from an unimmunized individual and stimulated with a polypeptide, or a epitope 
thereof, in vitro. In a further variation, B-lymphocytes are obtained from an infected, or 
otherwise immunized individual, and then hyperimmunized by exposure to a selected 
Neisseria meningitidis ORF polypeptide for about seven to fourteen days, in vitro. 

The immunized B-lymphocytes prepared by one of the above procedures 
are fused with a xeonogenic hybrid cell by well known methods. For example, the cells 
are treated with 40-50% polyethylene glycol of MW 1000-4000, at about 37°C for about 
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5-10 min. Cells are separated from the fusion mixture and propagated in media selective 
for the desired hybrids. When the xenogeneic hybrid cell is resistant to 8-azaguanine, 
immortalized trioma cells are conveniendy selected by successive passage of cells on 
HAT or AH medium. Other selective procedures are, of course, possible depending on 
5 the nature of the cells used in fusion. Clones secreting antibodies having the required 

binding specificity are identified by assaying the trioma culture medium for the ability to 
bind to a selected Neisseria meningitidis polypeptide or an epitope thereof. Triomas 
producing human antibodies having the desired specificity are subcloned, e.g., by the 
limiting dilution technique, and grown in vitro, in culture medium, or are injected into 

10 selected host animals and grown in vivo. 

The trioma cell lines obtained are then tested for the ability to bind a 
polypeptide or an epitope thereof. Antibodies are separated from the resulting culture 
medium or body fluids by conventional antibody- fractionation procedures, such as 
ammonium sulfate precipitation, DEAE cellulose chromatography and affinity 

15 chromatography. 

Although triomas are genetically stable they do not produce antibodies at 
very high levels. Expression levels can be increased by cloning antibody genes from the 
trioma into one or more expression vectors, and transforming the vector into a cell line 
such as the cell lines typically used for expression of recombinant or humanized 

20 immunoglobulins. As well as increasing yield of antibody, this strategy offers the 

additional advantage that immunoglobulins are obtained from a cell line that does not 
have a human component, and does not therefore need to be subjected to the extensive 
viral screening required for human cell lines. 

The genes encoding the heavy and light chains of immunoglobulins 

25 secreted by trioma cell lines are cloned according to methods, including the polymerase 
chain reaction, known in the art {see, e.g., Sambrook, and Berger & Kimmel, both 
supra). For example, genes encoding heavy and light chains are cloned from a trioma' s 
genomic DNA or cDNA produced by reverse transcription of the trioma's RNA. 
Cloning is accomplished by conventional techniques including the use of PCR primers 

30 that hybridize to the sequences flanking or overlapping the genes, or segments of genes, 
to be cloned. 

Typically, recombinant constructs comprise DNA segments encoding a 
complete human immunoglobulin heavy chain and/or a complete human immunoglobulin 
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light chain of an immunoglobulin expressed by a trioma cell line. Alternatively, DNA 
segments encoding only a portion of the primary antibody genes are produced, which 
portions possess binding and/or effector activities. Other recombinant constructs contain 
segments of trioma cell line immunoglobulin genes fused to segments of other 
immunoglobulin genes, particularly segments of other human constant region sequences 
(heavy and/or light chain). Human constant region sequences can be selected from 
various reference sources, including but not limited to those listed in Kabat et aL (1987), 
Sequences of Proteins of Immunological Interest, U.S. Department of Health and Human 
Services. 

In addition to the DNA segments encoding anti-ORF immunoglobulins or 
fragments thereof, other substantially homologous modified immunoglobulins can be 
readily designed and manufactured utilizing various recombinant DNA techniques known 
to those skilled in the art such as site-directed mutagenesis (see Gillman & Smith 
(1979) Gene, 8: 81-97; Roberts et aL (1987) Nature, 328: 731-734). Such modified 
segments will usually retain antigen binding capacity and/or effector function. 
Moreover, the modified segments are usually not so far changed from the original trioma 
genomic sequences to prevent hybridization to these sequences under stringent 
conditions. Because, like many genes, immunoglobulin genes contain separate functional 
regions, each having one or more distinct biological activities, the genes may be fused to 
functional regions from other genes to produce fusion proteins (e.g., immunotoxins) 
having novel properties or novel combinations of properties. 

The recombinant polynucleotide constructs will typically include an 
expression control sequence operably linked to the coding sequences, including 
naturally-associated or heterologous promoter regions. Preferably, the expression control 
sequences will be eukaryotic promoter systems in vectors capable of transforming or 
transfecting eukaryotic host cells. Once the vector has been incorporated into the 
appropriate host, the host is maintained under conditions suitable for high level 
expression of the nucleotide sequences, and the collection and purification of the human 
immunoglobulins. 

These expression vectors are typically replicable in the host organisms 
either as episomes or as an integral part of the host chromosomal DNA. Commonly, 
expression vectors will contain selection markers, e.g., ampicillin-resistance or 
hygromycin-resistance, to permit detection of those cells transformed with the desired 
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DNA sequences. In general, prokaryotes or eukaryotic cells are used for cloning the 
DNA sequences encoding a human immunoglobulin chain. 

Other approaches include in vitro immunization of human blood. In this 
approach, human blood lymphocytes capable of producing human antibodies are 
produced. Human peripheral blood is collected from the patient and is treated to recover 
mononuclear cells. The suppressor T-cells then are removed and remaining cells are 
suspended in a tissue culture medium to which is added the antigen and autologous serum 
and, preferably, a nonspecific lymphocyte activator. The cells then are incubated for a 
period of time so that they produce the specific antibody desired. The cells then can be 
fused to human myeloma cells to immortalize the cell line, thereby to permit continuous 
production of antibody (see U.S. Patent 4,716,111). 

In another approach, mouse-human hybridomas which produce human 
antibodies are prepared (see, e.g., 5,506,132). Other approaches include immunization 
of mice transformed to express human immunoglobulin genes, and phage display 
screening (Vaughan et al. supra.). 
Cell-Me diated Immune Responses 

In addition to the production of antibodies, the present invention provides 
for cell-mediated immune responses against Neisseria meningitidis. As above, a 
polypeptide of the invention (e.g., ORF 1 (SEQ ID NO:2), ORF 2a (SEQ ID NO:4), 
ORF 2b (SEQ ID NO:5), or ORF 3 (SEQ ID NO: 7), or a subsequence thereof) is 
administered to a mammal. The proliferative effect of these antigens is tested in a 
standard MLR assay. MLR assays or "mixed lymphocyte response" assays are the 
standard in vitro assay of antigen presenting function in cellular immunity. The assay 
measures the proliferation of T cells after stimulation by a selected antigen-presenting 
cell type. The number of T cells produced are typically characterized by measuring T 
cell proliferation based on incorporation of 3 H-thymidine in culture. Similar methods are 
used in vivo in nude or SCID mouse models. See also, Paul (supra) at chapter 31. The 
most commonly measured form of cell-mediated immune response is a cytotoxic T- 
lymphocyte (CTL) response. 

Antigenic peptides are used to elicit CTL ex vivo. The resulting CTL, can 
be used to treat chronic infections in patients that do not respond to other conventional 
forms of therapy, or will not respond to a peptide vaccine approach of therapy. Ex vivo 
CTL responses to a particular pathogen (infectious agent or tumor antigen) are induced 
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by incubating in tissue culture the patient's CTL precursor cells (CTLp) together with a 
source of antigen-presenting cells (APC) and the appropriate immunogenic peptide. 
After an appropriate incubation time (typically 1-4 weeks), in which the CTLp are 
activated and mature and expand into effector CTLs, the ceils are infused back into the 
5 patient, where they will destroy their specific target cell (e.g., an infected cell). 
Detection of Neisseria meningitidis 

As indicated above, Neisseria meningitidis infection causes serious health 
problems, and has the potential to reach epidemic proportions in some populations. 
Accordingly, new methods of detecting infection of patients by Neisseria meningitidis is 
10 of considerable value. 

Thus, it is desirable to determine the presence or absence of Neisseria 
meningitidis in a patient, or to quantify the severity of infection, or quantify the 
expression of Neisseria meningitidis polypeptides or nucleic acids. In addition, the 
polypeptides of the invention are used to detect antisera against the polypeptides, e.g. , in 
15 patients previously infected with Neisseria meningitidis. 

Detection of Neisseria meningitidis or antisera against Neisseria 
meningitidis is accomplished by assaying the products of the Neisseria meningitidis 
nucleic acids of the invention; the nucleic acids themselves, or antibodies against 
polypeptides encoded by the nucleic acids. It is desirable to determine whether 
20 polypeptide expression is present, absent, or abnormal (e.g. because of an abnormal gene 
product or because of abnormal expression). 

The selected Neisseria meningitidis nucleic acid or nucleic acid product 
(i.e., an mRNA or polypeptide) is preferably detected and/or quantified in a biological 
sample. Such samples include, but are not limited to, cerebrospinal fluid, sputum, 
25 amniotic fluid, blood, blood cells (e.g., white cells), tissue or fine needle biopsy 

samples, urine, peritoneal fluid, and pleural fluid, or cells therefrom. Biological samples 
may also include sections of tissues such as frozen sections taken for histological 
purposes. Although the sample is typically taken from a human patient, the assays can 
be used to detect Neisseria meningitidis or Neisseria meningitidis gene products in 
30 samples from any mammal, such as dogs, cats, sheep, cattle, rodents, primates and pigs. 

The sample may be pretreated as necessary by dilution in an appropriate 
buffer solution or concentrated, if desired. Any of a number of standard aqueous buffer 
solutions, employing one of a variety of buffers, such as phosphate, Tris, or the like, at 
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In one embodiment, this invention provides for methods of detecting 
and/or quantifying Neisseria meningitidis gene expression by assaying the underlying 
gene (or a fragment thereof) or by assaying the gene transcript (mRNA). The assay can 
5 be for the presence or absence of the normal gene or gene product, for the presence or 
absence of an abnormal gene or gene product, or quantification of the transcription levels 
of normal or abnormal gene products. 

In a preferred embodiment, nucleic acid assays are performed with a 
sample of nucleic acid isolated from the organism to be tested. In the simplest 
10 embodiment, such a nucleic acid sample is the total mRNA isolated from a biological 
sample. The nucleic acid (e.g., either genomic DNA or mRNA) may be isolated from 
the sample according to any of a number of methods well known to those of skill in the 
art. 

Methods of isolating total DNA or mRNA are well known to those of skill 
15 in the art. For example, methods of isolation and purification of nucleic acids are 

described in detail in Chapter 3 of Laboratory Techniques in Biochemistry and Molecular 
Biology: Hybridization With Nucleic Acid Probes, Part I. Theory and Nucleic Acid 
Preparation, P. Tijssen, ed. Elsevier, N.Y. (1993) and Chapter 3 of Laboratory 
Techniques in Biochemistry and Molecular Biology: Hybridization With Nucleic Acid 
20 Probes, Part I. Theory and Nucleic Acid Preparation, P. Tijssen, ed. Elsevier, N.Y. 
(1993)). 

Frequently, it is desirable to amplify the nucleic acid sample prior to 
hybridization. Methods of "quantitative" amplification are well known to those of skill 
in the art. For example, quantitative PCR involves simultaneously co-amplifying a 

25 known quantity of a control sequence using the same primers. This provides an internal 
standard that may be used to calibrate the PCR reaction. Detailed protocols for 
quantitative PCR are provided in PCR Protocols, A Guide to Methods and Applications, 
Innis et al., Academic Press, Inc. N.Y., (1990). Other suitable amplification methods 
include, but are not limited to those described supra. 

30 Amplification-based assays are well known to those of skill in the art (see, 

e.g., Innis, supra.). The Neisseria meningitidis nucleic acid sequences provided are 
sufficient to teach one of skill to routinely select primers to amplify any portion of the 
gene. It is expected that one of skill is thoroughly familiar with the theory and practice 
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of nucleic acid hybridization and primer selection. Gait, ed. Oligonucleotide Synthesis: 
A Practical Approach, IRL Press, Oxford (1984); W.H.A. Kuijpers Nucleic Acids 
Research 18(17), 5197 (1994); K.L. Dueholm J. Org. Chem. 59, 5767-5773 (1994); S. 
Agrawal (ed.) Methods in Molecular Biology, volume 20; and Tijssen (1993) Laboratory 
Techniques in biochemistry and molecular biology—hybridization with nucleic acid 
probes, e.g., part I chapter 2 "overview of principles of hybridization and the strategy of 
nucleic acid probe assays", Elsevier, New York provide a basic guide to nucleic acid 
hybridization. Innis supra provides an overview of primer selection. In addition, PCR 
amplification products are optionally detected on a polymer array as described in Fodor 
et al. (1991) Science, 251: 767- 777; Sheldon et aL (1993) Clinical Chemistry 39(4): 
718-719, and Kozal et al. (1996) Nature Medicine 2(7): 753-759. 

Most typically, amplification primers are between 8 and 100 nucleotides in 
length, and preferably between about 10 and 30 nucleotides in length. More typically, 
the primers are between about 15 and 25, nucleic acids in length. 

One of skill will recognize that the 3' end of an amplification primer is 
more important for PCR than the 5* end. Investigators have reported PCR products 
where only a few nucleotides at the 3' end of an amplification primer were 
complementary to a DNA to be amplified. In this regard, nucleotides at the 5 1 end of a 
primer can incorporate structural features unrelated to the target nucleic acid; for 
instance, in one preferred embodiment, a sequencing primer hybridization site (or a 
complement to such as primer, depending on the application) is incorporated into the 
amplification primer, where the sequencing primer is derived from a primer used in a 
standard sequencing kit, such as one using a biotinylated or dye-labeled universal M13 or 
SP6 primer. Alternatively, the primers optionally incorporate restriction endonuclease 
sites. The primers are selected so that there is no complementarity between any known 
sequence which is likely to occur in the sample to be amplified and any constant primer 
region. One of skill will appreciate that constant regions in primer sequences are 
optional. 

Typically, all primer sequences are selected to hybridize only to a 
perfectly complementary DNA, with the nearest mismatch hybridization possibility from 
known DNA sequences which are likely to occur in the sample to be amplified having at 
least about 50 to 70% hybridization mismatches, and preferably 100% mismatches for 
the terminal 5 nucleotides at the 3' end of the primer. 



WO 98/17805 



PCT/US97/19424 



32 

The primers are selected so that no secondary structure forms within the 
primer. Self-complementary primers have poor hybridization properties, because the 
complementary portions of the primers self hybridize (i.e., form hairpin structures). The 
primers are also selected so that the primers do not hybridize to each other, thereby 
preventing duplex formation of the primers in solution, and possible concatenation of the 
primers during PCR. If there is more than one constant region in the primer, the 
constant regions of the primer are selected so that they do not self-hybridize or form 
hairpin structures. 

Where sets of amplification primers (i.e., the 5' and 3 1 primers used for 
exponential amplification) are of a single length, the primers are selected so that they 
have roughly the same, and preferably exactly the same overall base composition (i.e., 
the same A+T to G+C ratio of nucleic acids). Where the primers are of differing 
lengths, the A+T to G+C ratio is determined by selecting a thermal melting temperature 
for the primer-DNA hybridization, and selecting an A+T to G+C ratio and probe length 
for each primer which has approximately the selected thermal melting temperature. 

One of skill will recognize that there are a variety of possible ways of 
performing the above selection steps, and that variations on the steps are appropriate. 
Most typically, selection steps are performed using simple computer programs to perform 
the selection as outlined above; however, all of the steps are optionally performed 
manually. One available computer program for primer selection is the Mac Vector 
program from Kodak. In addition to commercially available programs for primer 
selection, one of skill can easily design simple programs for any of the preferred 
selection steps. Amplification primers can be selected to provide amplification products 
that span specific deletions, truncations, and insertions in an amplification target, thereby 
facilitating the detection of specific abnormalities such as a transposon insertion as 
described herein. 

Where it is desired to quantify the transcription level (and thereby 
expression) of a normal or mutated Neisseria meningitidis gene in a sample, the nucleic 
acid sample is one in which the concentration of the mRNA transcript(s) of the gene, or 
the concentration of the nucleic acids derived from the mRNA transcript(s), is 
proportional to the transcription level (and therefore expression level) of that gene. 
Similarly, it is preferred that the hybridization signal intensity be proportional to the 
amount of hybridized nucleic acid. While it is preferred that the proportionality be 
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relatively strict {e.g., a. doubling in transcription rate results in a doubling in mRNA 
transcript in the sample nucleic acid pool and a doubling in hybridization signal), one of 
skill will appreciate that the proportionality can be more relaxed and even non-linear. 
Thus, for example, an assay where a 5 fold difference in concentration of a target 
5 mRNA results in a 3 to 6 fold difference in hybridization intensity is sufficient for most 
purposes. Where more precise quantification is required appropriate controls can be run 
to correct for variations introduced in sample preparation and hybridization as described 
herein. In addition, serial dilutions of "standard" target mRNAs can be used to prepare 
calibration curves according to methods well known to those of skill in the art. Of 

10 course, where simple detection of the presence or absence of a transcript is desired, no 
elaborate control or calibration is required. 

Neisseria meningitidis polypeptide assays. 

The expression of selected Neisseria meningitidis polypeptides can also be 
detected and/or quantified by detecting or quantifying the expressed polypeptide. The 

15 polypeptides can be detected and quantified by any of a number of means well known to 
those of skill in the art. These may include analytic biochemical methods such as 
electrophoresis, capillary electrophoresis, high performance liquid chromatography 
(HPLC), thin layer chromatography (TLC), hyperdiffusion chromatography, and the like, 
or various immunological methods such as fluid or gel precipitin reactions, 

20 immunodiffusion (single or double), immunoelectrophoresis, radioimmunoassay (RI A), 
enzyme-linked immunosorbent assays (ELISAs), immunofluorescent assays, western 
blotting, and the like. 

In a particularly preferred embodiment, the polypeptides are detected in an 
electrophoretic protein separation, more preferably in a two-dimensional electrophoresis, 

25 while in a most preferred embodiment, the polypeptides are detected using an 
immunoassay. 

As used herein, an immunoassay is an assay that utilizes an antibody to 
specifically bind to the analyte (e.g., selected polypeptide, such as ORF 1 (SEQ ID 
NO:2), ORF 2a (SEQ ID NO:4), ORF 2b (SEQ ID NO:5), or ORF 3 (SEQ ID NO:7)). 
30 The immunoassay is thus characterized by detection of specific binding of a polypeptide 
to an anti-polypeptide antibody, as opposed to the use of other physical or chemical 
properties to isolate, target, and quantify the analyte. 
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As indicated above, the presence or absence of polypeptides in a biological 
sample can be determined using electrophoretic methods. Means of detecting proteins 
using electrophoretic techniques are well known to those of skill in the art (see generally, 
R. Scopes (1982) Protein Purification, Springer- Verlag, N.Y.; Deutscher, (1990) 
5 Methods in Enzymology Vol 182: Guide to Protein Purification.. Academic Press, Inc., 
N.Y.). 

In a preferred embodiment, the polypeptides are detected and/or quantified 
using any of a number of well recognized immunological binding assays (see, e.g., U.S. 
Patents 4,366,241; 4,376, 1 10; 4,517,288; and 4,837, 168). For a review of the general 

10 immunoassays, see also Methods in Cell Biology Volume 37: Antibodies in Cell Biology, 
Asai, ed.. Academic Press, Inc. New York (1993); Basic and Clinical Immunology 7th 
Edition, Stites & Terr, eds. (1991). Immunological binding assays (or immunoassays) 
typically utilize a "capture agent" to specifically bind to and often immobilize the 
analyte. The capture agent is a moiety that specifically binds to the analyte. In a 

15 preferred embodiment, the capture agent is an antibody that specifically binds 

polypeptide(s) or polypeptide subsequences {e.g., antigenic domains which specifically 
bind to the antibody). In a second preferred embodiment, the capture agent is the 
polypeptide and the analyte is antisera comprising an antibody which specifically binds to 
the polypeptide. 

20 Immunoassays often utilize a labeling agent to specifically bind to and 

label the binding complex formed by the capture agent and the analyte. The labeling 
agent may itself be one of the moieties comprising the antibody/analyte complex. Thus, 
the labeling agent may be a labeled polypeptide or a labeled anti-polypeptide antibody. 
Alternatively, the labeling agent may be a third moiety, such as another antibody, that 

25 specifically binds to the antibody /polypeptide complex. 

In a preferred embodiment, the labeling agent is a second antibody bearing 
a label. Alternatively, the second antibody may lack a label, but it may, in turn, be 
bound by a labeled third antibody specific to antibodies of the species from which the 
second antibody is derived. The second antibody can be modified with a detectable 

30 moiety, such as biotin, to which a third labeled molecule can specifically bind, such as 
enzyme-labeled streptavidin. 

Other proteins capable of specifically binding immunoglobulin constant 
regions, such as streptococcal protein A or protein G may also be used as the label 
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agent. These proteins are normal constituents of the cell walls of streptococcal bacteria. 
They exhibit a strong non-immunogenic reactivity with immunoglobulin constant regions 
from a variety of species {see, generally Kronval, et al. (1973) J. Immunol., Ill: 1401- 
1406, and Akerstrom, et al. (1985)7. Immunol, 135: 2589-2542). 

Throughout the assays, incubation and/or washing steps are optionally 
performed after each combination of reagents. Incubation steps can vary from about 5 
seconds to several hours, preferably from about 5 minutes to about 24 hours. However, 
the incubation time will depend upon the assay format, analyte, volume of solution, 
concentrations, and the like. Usually, the assays will be carried out at ambient 
temperature, although they can be conducted over a range of temperatures, such as 10°C 
to 40°C, 

Immunoassays for detecting polypeptides may be either competitive or 
noncompetitive. Noncompetitive immunoassays are assays in which the amount of 
captured analyte is directly measured. In one preferred "sandwich" assay, for example, 
the capture agent can be bound directly to a solid substrate where they are immobilized. 
These immobilized capture agent then captures analyte present in the test sample. The 
analyte thus immobilized is then bound by a labeling agent, such as a second antibody 
bearing a label. Alternatively, the second antibody may lack a label, but it may, in turn, 
be bound by a labeled third antibody specific to antibodies of the species from which the 
second antibody is derived. The second can be modified with a detectable moiety, such 
as biotin, to which a third labeled molecule can specifically bind, such as enzyme-labeled 
streptavidin. 

In competitive assays, the initial amount of analyte present in the sample is 
measured indirectly by measuring the amount of an added (exogenous) analyte displaced 
(or competed away) from a capture agent by the analyte present in the sample. In one 
competitive assay, a known amount of, in this case, analyte is added to the sample and 
the sample is then contacted with a capture agent. The amount of exogenous analyte 
bound to the capture agent is inversely proportional to the initial analyte present in the 
sample. 

In a preferred embodiment, western blot (immunoblot) analysis is used to 
detect and quantify the presence of selected Neisseria meningitidis in the sample. The 
technique generally comprises separating sample proteins by gel electrophoresis on the 
basis of molecular weight, transferring the separated proteins to a suitable solid support, 
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(such as a nitrocellulose filter, a nylon filter, or derivatized nylon filter), and incubating 
the sample with the antibodies that specifically bind the selected polypeptide. The 
antibodies specifically bind to polypeptide on the solid support. These antibodies are 
optionally directly labeled or alternatively are optionally subsequently detected using 
5 labeled antibodies {e.g., labeled sheep anti-mouse antibodies) that specifically bind to the 
selected polypeptide. 

Other assay formats include liposome immunoassays (LIA), which use 
liposomes designed to bind specific molecules {e.g., antibodies) and release encapsulated 
reagents or markers. The released chemicals are then detected according to standard 

10 techniques {see, Monroe et al. (1986) Amer. Clin. Prod. Rev. 5:34-41). Enzyme linked 
assays {e.g. , ELISA assays) are also preferred. 

The assays of this invention as scored (as positive or negative for Neisseria 
meningitidis or a selected Neisseria meningitidis polypeptide) according to standard 
methods well known to those of skill in the art. The particular method of scoring will 

15 depend on the assay format and choice of label. For example, a western blot assay can 
be scored by visualizing the colored product produced by the enzymatic label. A clearly 
visible colored band or spot at the correct molecular weight is scored as a positive result, 
while the absence of a clearly visible spot or band is scored as a negative. In a preferred 
embodiment, a positive test will show a signal intensity {e.g., polypeptide quantity) at 

20 least twice that of the background and/or control and more preferably at least 3 times or 
even at least 5 times greater than the background and/or negative control. 

One of skill in the art will appreciate that it is often desirable to reduce 
non-specific binding in immunoassays. Particularly, where the assay involves an antigen 
or antibody immobilized on a solid substrate it is desirable to minimize the amount of 

25 non-specific binding to the substrate. Means of reducing such non-specific binding are 

well known to those of skill in the art. Typically, this involves coating the substrate with 
a proteinaceous composition. In particular, protein compositions such as bovine serum 
albumin (BSA), nonfat powdered milk, and gelatin. 

The particular label or detectable group used in the assay is not a critical 

30 aspect of the invention, so long as it does not significantly interfere with the specific 
binding of the antibody used in the assay. The detectable group can be any material 
having a detectable physical or chemical property. Such detectable labels have been 
well-developed in the field of immunoassays and, in general, most any label useful in 
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such methods can be applied to the present invention. Thus, a label is any composition 
detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, 
optical or chemical means. Useful labels in the present invention include magnetic beads 
(e.g. Dynabeads™), fluorescent dyes (e.g., fluorescein isothiocyanate, texas red, 
5 rhodamine, and the like), radiolabels (e.g., 3 H T "2*1, ^S, ,4 C, or enzymes (e.g. , 

horse radish peroxidase, alkaline phosphatase and others commonly used in an ELISA), 
and colorimetric labels such as colloidal gold or colored glass or plastic (e.g. 
polystyrene, polypropylene, latex, etc.) beads. 

The label may be coupled directly or indirectly to the desired component 

10 of the assay according to methods well known in the art. As indicated above, a wide 

variety of labels may be used, with the choice of label depending on sensitivity required, 
ease of conjugation with the compound, stability requirements, available instrumentation, 
and disposal provisions. 

Non-radioactive labels are often attached by indirect means. Generally, a 

15 ligand molecule (e.g., biotin) is covalently bound to the molecule. The ligand then binds 
to an anti-ligand (e.g., streptavidin) molecule which is either inherently detectable or 
covalently bound to a signal system, such as a detectable enzyme, a fluorescent 
compound, or a chemiluminescent compound. A number of ligands and anti-ligands can 
be used. Where a ligand has a natural anti-ligand, for example, biotin, thyroxine, and 

20 Cortisol, it can be used in conjunction with the labeled, naturally occurring anti-ligands. 
Alternatively, any haptenic or antigenic compound can be used in combination with an 
antibody. 

The molecules can also be conjugated directly to signal generating 
compounds, e.g., by conjugation with an enzyme or fluorophore. Enzymes of interest as 

25 labels will primarily be hydrolases, particularly phosphatases, esterases and glycosidases, 
or oxidoreductases, particularly peroxidases. Fluorescent compounds include fluorescein 
and its derivatives, rhodamine and its derivatives, dansyl, umbelliferone, etc. 
Chemiluminescent compounds include luciferin, and 2,3-dihydrophthalazinediones, e.g., 
luminol. For a review of various labeling or signal producing systems which may be 

30 used, see, U.S. Patent No. 4,391,904). 

Means of detecting labels are well known to those of skill in the art. 
Thus, for example, where the label is a radioactive label, means for detection include a 
scintillation counter or photographic film as in autoradiography. Where the label is a 
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fluorescent label, it may be detected by exciting the fluorochrome with the appropriate 
wavelength of light and detecting the resulting fluorescence. The fluorescence may be 
detected visually, by means of photographic film, by the use of electronic detectors such 
as charge coupled devices (CCDs) or photomultipliers and the like. Similarly, enzymatic 
5 labels may be detected by providing the appropriate substrates for the enzyme and 
detecting the resulting reaction product. Finally simple colorimetric labels may be 
detected simply by observing the color associated with the label. Thus, in various 
dipstick assays, conjugated gold often appears pink, while various conjugated beads 
appear the color of the bead. 

10 Some assay formats do not require the use of labeled components. For 

instance,. agglutination assays can be used to detect the presence of the target antibodies. 
In this case, antigen-coated particles are agglutinated by samples comprising the target 
antibodies. In this format, none of the components need be labeled and the presence of 
the target antibody is detected by simple visual inspection. 

15 As mentioned above, depending upon the assay, various components, 

including the antigen, target antibody, or anti-antibody, may be bound to a solid surface. 
Many methods for immobilizing biomolecules to a variety of solid surfaces are known in 
the art. For instance, the solid surface may be a membrane (e.g., nitrocellulose), a 
microtiter dish (e.g., PVC, polypropylene, or polystyrene), a test tube (glass or plastic), 

20 a dipstick (e.g. glass, PVC,. polypropylene, polystyrene, latex, and the like), a 
microcentrifuge tube, or a glass or plastic bead. The desired component may be 
covalently bound or noncovalently attached through nonspecific bonding. 

A wide variety of organic and inorganic polymers, both natural and 
synthetic may be employed as the material for the solid surface. Illustrative polymers 

25 include polyethylene, polypropylene, poiy(4-methylbutene), polystyrene, 

poly methacry late, poly(ethylene terephthalate), rayon, nylon, poly (vinyl butyrate), 
polyvinylidene difluoride (PVDF), silicones, polyformaldehyde, cellulose, cellulose 
acetate, nitrocellulose, and the like. Other materials which may be employed, include 
paper, glasses, ceramics, metals, metalloids, semiconductive materials, cements or the 

30 like. In addition, are included substances that form gels, such as proteins (e.g., 
gelatins), lipopolysaccharides, silicates, agarose and polyacrylamides can be used. 
Polymers which form several aqueous phases, such as dextrans, polyalkylene glycols or 
surfactants, such as phospholipids, long chain (12-24 carbon atoms) alkyl ammonium 
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salts and the like are also suitable. Where the solid surface is porous, various pore sizes 
may be employed depending upon the nature of the system. 

In preparing the surface, a plurality of different materials may be 
employed, particularly as laminates, to obtain various properties. For example, protein 
5 coatings, such as gelatin can be used to avoid non-specific binding, simplify covalent 
conjugation, enhance signal detection or the like. 

If covalent bonding between a compound and the surface is desired, the 
surface will usually be polyfunctional or be capable of being polyfunctionalized. 
Functional groups which may be present on the surface and used for linking can include 
10 carboxylic acids, aldehydes, amino groups, cyano groups, ethylenic groups, hydroxyl 
groups, mercapto groups and the like. The manner of linking a wide variety of 
compounds to various surfaces is well known and is amply illustrated in the literature. 
See, for example, Immobilized Enzymes, Ichiro Chibata, Halsted Press, New York, 1978, 
and Cuatrecasas (1970) J. Biol. Chem. 245 3059). 
15 In addition to covalent bonding, various methods for noncovalently binding 

an assay component can be used. Noncovalent binding is typically nonspecific 
absorption of a compound to the surface. Typically, the surface is blocked with a second 
compound to prevent nonspecific binding of labeled assay components. Alternatively, 
the surface is designed such that it nonspecifically binds one component but does not 
20 significantly bind another. For example, a surface bearing a lectin such as Concanavalin 
A will bind a carbohydrate containing compound but not a labeled protein that lacks 
glycosylation. Various solid surfaces for use in noncovalent attachment of assay 
components are reviewed in U.S. Patent Nos. 4,447,576 and 4,254,082. 
Detection kits 

25 The present invention also provides kits for the diagnosis of patients 

infected with Neisseria meningitidis. The kits preferably include one or more reagents for 
determining the presence or absence of a selected Neisseria meningitidis nucleic acid or 
protein, i.e., any of the nucleic acids or proteins described herein. Preferred reagents 
include nucleic acid probes that specifically bind to Seq 1 (SEQ ID NO:l), Seq 2 (SEQ 

30 ID NO:3), Seq 3 (SEQ ID NO:6), or Seq 4 (SEQ ID NO:8); cDNA corresponding to 

Seq 1 (SEQ ID NO:l), Seq 2 (SEQ ID NO:3), Seq 3 (SEQ ID NO:6), or Seq 4 (SEQ ID 
NO: 8), or a subsequence thereof; probes that specifically bind to an abnormal Neisseria 
meningitidis gene (e.g., one containing premature truncations, insertions, or deletions), 
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and antibodies that specifically bind to polypeptides or subsequences thereof. The 
antibody or hybridization probe may be free or immobilized on a solid support such as a 
test tube, a microtiter plate, a dipstick or the like. The kit may also contain instructional 
materials teaching the use of the antibody or hybridization probe in an assay for the 
5 detection of Neisseria meningitidis , a container or other packaging material or the like. 

The kits may include alternatively, or in combination with any of the other 
components described herein, an antibody which specifically binds a polypeptide of the 
invention. The antibody can be monoclonal or polyclonal. The antibody can be 
conjugated to another moiety such as a label and/or it can be immobilized on a solid 
10 support (substrate). 

The kits also optionally include a second antibody for detection of 
polypeptide/antibody complexes or for detection of hybridized nucleic acid probes. The 
kits optionally include appropriate reagents for detection of labels, positive and negative 
controls, washing solutions, dilution buffers and the like. 
15 Intracellul ar Immunization and Gene Therapy 

In one preferred class of embodiments, the nucleic acids of the invention 
are used in cell transformation procedures for intracellular immunization and gene 
therapy to inhibit or prevent meningitis caused by Neisseria meningitidis serogroup B. 
Gene therapy provides methods for combating chronic infectious diseases. In vitro, ex 
20 vivo and in vivo procedures. are used. The nucleic acids of the invention optionally 

encode antisense oligonucleotides which bind to selected Neisseria meningitidis nucleic 
acids {e.g., RNAs encoded by Seq 1 (SEQ ID NO:l), Seq 2 (SEQ ID NO:3), Seq 3 
(SEQ ID NO:6), or Seq 4 (SEQ ID NO:8); see. Figures 5, 6, 7 and 4, respectively) with 
high affinity. These oligonucletides are typically cloned into gene therapy vectors that 
25 are competent to transform cells (such as human or other mammalian cells) in vitro 
and/or in vivo. 

Several approaches for introducing nucleic acids into cells in vivo, ex vivo 
and in vitro have been used. These include liposome based gene delivery (Debs and Zhu 
(1993) WO 93/24640; Mannino and Gould-Fogerite (1988) BioTechniques 6(7): 682-691; 
30 Rose U.S. Pat No. 5,279,833; Brigham (1991) WO 91/06309; and Feigner et aL (1987) 
Proc. Natl. Acad. Sci. USA 84: 7413-7414) and replication-defective retroviral vectors 
harboring a therapeutic polynucleotide sequence as part of the retroviral genome (see, 
e.g., Miller et aL (1990) Mol. Cell. Biol. 10:4239 (1990); Koiberg (1992) J. NIH Res. 
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4:43, and Cornetta et aL Hum. Gene Ther. 2:215 (1991)). 

For a review of gene therapy procedures, see, Anderson, Science (1992) 
256:808-813; Nabel and Feigner (1993) T1BTECH 11: 211-217; Mitani and Caskey 
(1993) TIBTECH 11: 162-166; Mulligan (1993) Science 926-932; Dillon (1993) 
TIBTECH 11: 167-175; Miller (1992) Nature 357: 455-460; Van Brunt (1988) 
Biotechnology 6(10): 1149-1154; Vigne (1995) Restorative Neurology and Neuroscience 
8: 35-36; Kremer and Perricaudet (1995) British Medical Bulletin 51(1) 31-44; Haddada 
et aL (1995) in Current Topics in Microbiology and Immunology Doerfler and Bohm 
(eds) Springer- Verlag, Heidelberg Germany; and Yu et aL, Gene Therapy (1994) 1:13- 
26. 

Widely used retroviral vectors include those based upon murine leukemia 
virus (MuLV), gibbon ape leukemia virus (GaLV), Simian Immuno deficiency virus 
(SIV), human immuno deficiency virus (HIV), and combinations thereof. See, e.g., 
Buchscher et al. (1992) J. Virol. 66(5) 2731-2739; Johann et al. (1992) J. Virol. 66 
(5):1635-1640 (1992); Sommerfelt etaL, (1990) Virol. 176:58-59; Wilson et aL (1989) 
J. Virol. 63:2374-2378; Miller et al., J. Virol. 65:2220-2224 (1991); Wong-Staal et al., 
PCT/US94/05700, and Rosenburg and Fauci (1993) in Fundamental Immunology, Third 
Edition Paul (ed) Raven Press, Ltd., New York and the references therein, and Yu et 
aL, Gene Therapy (1994) supra). The vectors are optionally psuedotyped to extend the 
host range of the vector to cells which are not infected by the retrovirus corresponding to 
the vector. The vesicular stomatitis virus envelope glycoprotein (VSV-G) has been used 
to construct VSV-G-pseudotyped HIV vectors which can infect hematopoietic stem cells 
(Naldini et al. (1996) Science 272:263, and Akkina et aL (1996) / Virol 70:2581). 

Adeno-associated virus (AAV)-based vectors are also used to transduce 
cells with target nucleic acids, e.g., in the in vitro production of nucleic acids and 
peptides, and in in vivo and ex vivo gene therapy procedures. See, West et al. (1987) 
Virology 160:38-47; Carter et aL (1989) U.S. Patent No. 4,797,368; Carter et al. WO 
93/24641 (1993); Kotin (1994) Human Gene Therapy 5:793-801; Muzyczka (1994) J. 
Clin, lnvst. 94:1351 for an overview of AAV vectors. Construction of recombinant 
AAV vectors are described in a number of publications, including Lebkowski, U.S. Pat. 
No. 5,173,414; Tratschin et al. (1985) Mol. Cell. Biol. 5(1 1):325 1-3260; Tratschin, et 
al. (1984) MoL Cell. Biol., 4:2072-2081; Hermonat and Muzyczka (1984) Proc. Natl. 
Acad. Sci. USA, 81:6466-6470; McLaughlin et al. (1988) and Samulski et al. (1989) J. 
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Virol, 63:03822-3828. Cell lines that can be transformed by rAAV include those 
described in Lebkowski et al (1988) Mol Cell Biol, 8:3988-3996. 

Ex vivo methods for inhibiting Neisseria meningitidis replication in a cell 
in an organism involve transducing the cell ex vivo with a nucleic acid of this invention 
5 which expresses an antisense oligonucleotide of the invention, and introducing the cell 
into the organism. The culture of cells used in conjunction with the present invention, 
including cell lines and cultured cells from tissue or blood samples is well known in the 
art. Freshney (Culture of Animal Cells, a Manual of Basic Technique, third edition 
Wiley-Liss, New York (1994)) and the references cited therein provides a general guide 

10 to the culture of cells. Transformed ceils are cultured by means well known in the art. 
See, also.Kuchl&r et al (1977) Biochemical Methods in Cell Culture and Virology, 
Kuchler, R.J., Dowden, Hutchinson and Ross, Inc., and Atlas (1993) CRC Handbook of 
Microbiological Media (Parks ed) CRC press, Boca Raton, Fl. Mammalian cell systems 
often will be in the form of monolayers of cells, although mammalian cell suspensions 

15 are also used. Alternatively, cells can be derived from those stored in a cell bank (e.g., 
a blood bank). 

In one preferred use of the invention, expression of an oligonucleotide 
inhibits Neisseria meningitidis replication in any of those cells already infected with 
Neisseria meningitidis, in addition to conferring a protective effect to cells which are not 

20 infected. Thus, an organism infected with Neisseria meningitidis can be treated for the 
infection by transducing a population of its cells with a vector of the invention and 
introducing the transduced cells back into the organism. Thus, the present invention 
provides a method of protecting cells in vitro, ex vivo or in vivo, even when the cells are 
already infected with the virus against which protection is sought. 

25 A ribozyme is a catalytic antisense RNA molecule that cleaves other RNA 

molecules having particular target nucleic acid sequences. General methods for the 
construction of ribozymes against selected targets, including hairpin ribozymes, 
hammerhead ribozymes, RNAse P ribozymes (i.e., ribozymes derived from the naturally 
occurring RNAse P ribozyme from prokaryotes or eukaryotes) are known in the art. 

30 Castanotto et al (1994) Advances in Pharmacology 25: 289-317 provides and overview of 
ribozymes in general, including group I ribozymes, hammerhead ribozymes, hairpin 
ribozymes, RNAse P, and axhead ribozymes. 

Briefly, two types of ribozymes that are particularly useful in this 
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invention include the hairpin ribozyme and the hammerhead ribozyme. The hammerhead 
ribozyme (see, Rossie et al. (1991) Pharmac. Then 50:245-254; Forster and Symons 
(1987) Cell 48:211-220; Haseloff and Gerlach (1988) Nature 328:596-600; Walbot and 
Bmening (1988) Nature 334:196; Haseloff and Gerlach (1988) Nature 334:585; and 
5 Dropulic et al and Castanotto et aL, and the references cited therein, supra) and the 

hairpin ribozyme (see, e.g., Hampel et al. (1990) Nucl. Acids Res. 18:299-304; Hempel 
et aL, (1990) European Patent Publication No. 0 360 257; U.S. Patent No. 5,254,678; 
Wong-Staal et aL, PCT/US94/05700; Ojwang et al. (1993) Proc Natl Acad Sci USA 
90:6340-6344; Yamada et aL (1994) Human Gene Therapy 1:39-45; Leavitt et al. (1995) 

10 Proc Natl Acad Sci USA 92:699-703; Leavitt et al. (1994) Human Gene Therapy 

5:1151-1120; and Yamada et al. (1994) Virology 205:121-126) are catalytic molecules 
having antisense and endoribonucleotidase activity. 

The typical sequence requirement for the GUC hairpin ribozyme is a RNA 
sequence consisting of NNNG/CN*GUCNNNNNNNN (SEQ ID NO: 9) (where N*G is 

15 the cleavage site, and where N is any of G, U, C, or A). The sequence requirement at 
the cleavage site for the hammerhead ribozyme is any RNA sequence consisting of NUX 
(where N is any of G, U, C, or A and X represents C, U or A). Accordingly, the same 
target within the hairpin leader sequence, GUC, is targetable by the hammerhead 
ribozyme. The additional nucleotides of the hammerhead ribozyme or hairpin ribozyme 

20 is determined by the common target flanking nucleotides and. e.g., the hammerhead 
consensus sequences. 

Altman (1995) Biotechnology 13: 327-329 and the references therein 
describe the use of RNAse P as a therapeutic agent directed against flu virus. Similar 
therapeutic approaches can be used against selected Neisseria meningitidis RNAs by 

25 incorporating RNAse P catalytic domains into the antisense molecules of the invention. 

The anti sense molecules, including the ribozymes of this invention and 
DNA encoding the ribozymes, can be chemically synthesized as described supra, or 
prepared from a DNA molecule (that upon transcription yields an RNA molecule) 
operably linked to an appropriate promoter. 

30 Reporter genes. Sites of Replication and Selectable Markers 

To monitor the progress of cellular transduction, a marker or "reporter" 
gene is optionally encoded by the nucleic acids of the invention. The inclusion of 
detectable markers provides a means of monitoring the infection and stable transduction 
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of target cells. Markers include components of the beta-galactosidase gene, the firefly 
luciferase gene and the green fluorescence protein {see, e.g., Chalfie et al. (1994) 
Science 263:802). 

The vectors of the invention optionally include features which facilitate the 
5 replication in more than one cell type. For example, the replication of a plasmid as an 
episomal nucleic acid in mammalian cells can be controlled by the large T antigen in 
conjunction with an appropriate origin of replication, such as the origin of replication 
derived from the BK papovavirus. Many other features which permit a vector to be 
grown in multiple cell types (e.g., shuttle vectors which are replicated in prokaryotic and 
10 eukaryotic cells) are known. 

Selectable markers which facilitate cloning of the vectors of the invention 
are optionally included. Sambrook and Ausbel, both supra, provide an overview of 
selectable markers. 

The present invention provides nucleic acids for the transformation 

15 of cells in vitro and in vivo. These nucleic acids are typically packaged in vector 
particles. The nucleic acids are transfected into cells through the interaction of the 
vector particle surrounding the nucleic acid and the cellular receptor for the vector. For 
example, cells which are transfected by HIV based vectors in vitro include CD4+ cells, 
including T-cells such as Molt-4/8 cells, SupTl cells, H9 cells, C8166 cells and 

20 myelomonocytic (U937) cells, as well as primary human lymphocytes, and primary 
human monocyte-macrophage cultures, peripheral blood dendritic cells, follicular 
dendritic cells, epidermal Langerhans cells, megakaryocytes, microglia, astrocytes, 
oligodendroglia, CD8+ cells, retinal cells, renal epithelial cells, cervical cells, rectal 
mucosa, trophoblastic cells, and cardiac myocytes (see also, Rosenburg and Fauci 

25 Rosenburg and Fauci (1993) in Fundamental Immunology, Tfiird Edition Paul (ed) Raven 
Press, Ltd., New York). AAV based vectors transduce most mammalian cells. In one 
particularly preferred class of embodiments, the nucleic acids of the invention are used in 
cell transformation procedures for gene therapy. 

In addition to viral particles, a variety of protein coatings can be used to 

30 target nucleic acids to selected cell types. Transferrin-poly-cation conjugates enter cells 
which comprise transferrin receptors. See, e.g., Zenke et al (1990) Proc. Natl. Acad. 
ScL USA 87: 3655-3659; Curiel (1991) Proc. Natl. Acad Sci USA 88: 8850-8854 and 
Wagners al. (1993) Proc. Natl. Acad. Sci. USA 89:6099-6013. 
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Naked plasmid DNA bound electrostatically to poly-l-lysine or 
poly-l-lysine-transferrin which has been linked to defective adenovirus mutants can be 
delivered to cells with transection efficiencies approaching 90% (Curiel et aL (1991) 
Proc Natl Acad Sci USA 88:8850-8854; Cotten et ai (1992) Proc Natl Acad Sci USA 
5 89:6094-6098; Curiel et aL (1992) Hum Gene Ther 3:147-154; Wagner et aL (1992) 
Proc Natl Acad Sci USA 89:6099-6103; Michael et aL (1993) J Biol Chem 
268:6866-6869; Curiel et aL (1992) Am J Respir Cell Mol Biol 6:247-252, and Harris et 
aL (1993) Am J Respir Cell Mol Biol 9:441-447). The adenovirus-poly-l-Iysine-DNA 
conjugate binds to the normal adenovirus receptor and is subsequently internalized by 

10 receptor-mediated endocytosis. The adenovirus-poly-l-lysine-DNA conjugate binds to the 
normal adenovirus receptor and is subsequently internalized by receptor-mediated 
endocytosis. Similarly, other virus-poly-l-lysine-DNA conjugates bind the normal viral 
receptor and are subsequently internalized by receptor- mediated endocytosis. 
Accordingly, a variety of viral particles can be used to target vector nucleic acids to 

15 cells. 

In addition to, or in place of receptor-ligand mediated transduction, the 
vector nucleic acids of the invention are optionally complexed with liposomes to aid in 
cellular transduction. Liposome based gene delivery systems are described in Debs and 
Zhu (1993) WO 93/24640; Mannino and Gould-Fogerite (1988) BioTechniques 6(7): 682- 
20 691; Rose U.S. Pat No. 5,279,833; Brigham (1991) WO 91/06309; and Feigner et aL 
(1987) Proc. NatL Acad. ScL USA 84: 7413-7414. 
Ex Vivo Transduction of Cells 

Ex vivo methods for inhibiting viral replication in a cell in an organism 
involve transducing the cell ex vivo with a therapeutic nucleic acid of this invention, and 
25 introducing the cell into the organism. The cells are typically isolated or cultured from a 
patient. Alternatively, the cells can be those stored in a cell bank (e.g., a blood bank). 

In one class of embodiments, the vectors of the invention inhibit Neisseria 
meningitidis replication in cells already infected with Neisseria meningitidis, in addition 
to conferring a protective effect to cells which are not infected by Neisseria meningitidis. 
30 Thus, an organism infected with Neisseria meningitidis can be treated for the infection by 
transducing a population of its cells with a vector encoding an antisense molecule against 
a selected Neisseria meningitidis RNA and introducing the transduced cells back into the 
patient as described herein. Thus, the present invention provides compositions and 
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methods for protecting cells in culture, ex vivo and in a patient, even when the cells are 
already infected with the Neisseria meningitidis. 

The culture of cells used in conjunction with the present invention, 
including cell lines and cultured cells from tissue or blood samples is well known in the 
5 art. Freshney {Culture of Animal Cells, a Manual of Basic Technique, third edition 

Wiley-Liss, New York (1994)) and the references cited therein provides a general guide 
to the culture of cells. Transduced cells are cultured by means well known in the art. 
See, also Kuchler et al. (1977) Biochemical Methods in Cell Culture and Virology, 
Kuchler, R.J., Dowden, Hutchinson and Ross, Inc. Mammalian cell systems often will 

10 be in the form of monolayers of cells, although mammalian cell suspensions are also 
used. Illustrative examples of mammalian cell lines include the HEC-l-B cell line, 
VERO and Hela cells, Chinese hamster ovary (CHO) cell lines, W138, BHK, Cos-7 or 
MDCK cell lines (see, e.g., Freshney, supra). 

In one embodiment, CD34+ stem cells are optionally used in ex-vivo 

15 procedures for cell transduction and gene therapy. The advantage to using stem cells is 
that they can be introduced into a mammal (such as the donor of the cells) where they 
will engraft in the bone marrow and differentiate into many different immune cell types. 

In humans, CD34+ cells can be obtained from a variety of sources 
including cord blood, bone marrow, and mobilized peripheral blood. Purification of 

20 CD34+ ceils can be accomplished by antibody affinity procedures. An affinity column 
isolation procedure for isolating CD34+ cells is described by Ho et al. (1995) Stem Cells 
13 (suppl. 3): 100-105. See also, Brenner (1993) Journal of Hematotherapy 2: 7-17. 
Yu et al. (1995) PNAS 92: 699-703 describe a method of transducing CD34+ cells from 
human fetal cord blood using retroviral vectors. 

25 Rather than using stem cells, T cells or B cells are also used in some 

embodiments in ex vivo procedures. Several techniques are known for isolating T and B 
cells. The expression of surface markers facilitates identification and purification of such 
cells. Methods of identification and isolation of cells include FACS, incubation in flasks 
with fixed antibodies which bind the particular cell type and panning with magnetic 

30 beads. 

Administration of Nucleic Acids. Gene Therapy Vectors. Immunogenic Compositions 
and Transduced Cells 

Nucleic acids (typically DNA) encoding the polypeptides of the invention 
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are administered to patients to elicit an immune response against the polypeptides which 
they encode. DNA administered for this purpose is referred to as a U DNA vaccine." 
Methods of making and administering DNA as vaccines are known, and described, e.g., 
in Wolff et. al. y Science 247: 1465-1468 (1990). The nucleic acids of the invention, 
5 including antisense molecules, are also optionally administered to inhibit Neisseria 
meningitidis replication in cells transduced by the vectors, as described supra. 

In another aspect, the present invention is directed to administration of 
immunogenic compositions and vaccines which contain as an active ingredient an 
immunogenically effective amount of an immunogenic peptide as described herein. The 

10 peptide(s) may be introduced into a mammal, including a human. The peptide is 

optionally linked to a carrier, or is present as a homopolymer or heteropolymer of active 
peptide units. Polymerization of multiple units of the polypeptides of the invention 
provides the advantage of increased immunological reaction and, where different peptides 
are used to make up the polymer, the additional ability to induce antibodies and/or CTLs 

15 that react with different antigenic determinants of the virus or tumor cells. Useful 

carriers are well known in the art, and include, e.g., thyroglobulin, albumins such as 
human serum albumin, tetanus toxoid, polyamino acids such as poly (lysine: glutamic 
acid), influenza, hepatitis B virus core protein, hepatitis B virus recombinant vaccine and 
the like. The vaccines can also contain a physiologically tolerable (acceptable) diluent or 

20 excipient such as water, phosphate buffered saline, or saline. The vaccines and- 
immunogenic compositions of the invention further typically include an adjuvant. 
Adjuvants such as incomplete Freund's adjuvant, aluminum phosphate, aluminum 
hydroxide, or alum are materials well known in the art. CTL responses can be primed 
by conjugating peptides of the invention to lipids. Upon immunization with a peptide 

25 composition as described herein, the immune system of the host responds to the vaccine 
by producing antibodies and CTLs specific for the desired antigen, making the host 
resistant to later infection by Neisseria meningitidis, or resistant to developing chronic 
infection. In addition to the polypeptides herein, known Neisseria meningitidis 
immunogens are optionally present in any immunogenic or vaccine composition, thereby 

30 providing an immune response against the both peptides of the invention and known 
polypeptides. 

For therapeutic or immunization purposes, the peptides of the invention 
can also be expressed by attenuated viral hosts, such as vaccinia or fowlpox. This 
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approach involves the use of vaccinia virus as a vector to express nucleotide sequences 
that encode the peptides of the invention. Upon introduction into an acutely or 
chronically infected host or into a non-infected host, the recombinant vaccinia virus 
expresses the immunogenic peptide, and thereby elicits a host CTL response. Vaccinia 
vectors and methods useful in immunization protocols are described in, e.g., U.S. Patent 
No. 4,722,848, incorporated herein by reference. Another vector is BCG (Bacille 
Calmette Guerin). BCG vectors are described in Stover et al. (1991) Nature 
351:456-460. A wide variety of other vectors useful for therapeutic administration or 
immunization with the peptides of the invention, e.g., Salmonella typhi vectors and the 
like, will be apparent to those skilled in the art from the description herein. 

Accordingly, the present invention provides for administration of nucleic 
acids {e.g., DNA vaccines or ceil transformation vectors), polypeptides, immunogenic 
compositions comprising a polypeptide, vaccine components, and transduced cells {e.g., 
those made in ex vivo gene therapy or CTL procedures). Administration is by any of the 
routes normally used for introducing a molecule into ultimate contact with blood or tissue 
cells. Administration is made in any suitable manner, preferably with pharmaceutical^ 
acceptable carriers. Suitable methods of administering nucleic acids, proteins, vaccines, 
cells and immunogenic compositions in the context of the present invention to a patient 
are available. Intra-muscular and subcutaneous administration is appropriate for, e.g., 
vaccines, DNA vaccines, and immunogenic compositions, Parenteral administration such 
as intravenous administration is a suitable method of administration for transduced cells 
and cell transformation vectors. Formulations of compositions to be administered can be 
presented in unit-dose or multi-dose sealed containers, such as ampules and vials. 

Pharmaceutical^ acceptable excipients are determined in part by the 
particular composition being administered, as well as by the particular method used to 
administer the composition. Accordingly, there is a wide variety of suitable formulations 
of pharmaceutical compositions of the present invention. Formulations suitable for 
parenteral administration, such as, for example, by intraarticular (in the joints), 
intravenous, intramuscular, intradermal, intraperitoneal, and subcutaneous routes, include 
aqueous and non-aqueous, isotonic sterile injection solutions, which can contain 
antioxidants, buffers, bacteriostats, and solutes that render the formulation isotonic with 
the blood of the intended recipient, and aqueous and non-aqueous sterile suspensions that 
can include suspending agents, solubilizers, thickening agents, stabilizers, and 
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preservatives. 

The dose administered to a patient, in the context of the present invention 
should be sufficient to effect a beneficial therapeutic response in the patient over time, 
such as a reduction in the level of Neisseria meningitidis, or to inhibit infection by 
5 Neisseria meningitidis. The dose will be determined by the efficacy of the particular 
vector, nucleic acid or immunogenic composition employed and the condition of the 
patient, as well as the body weight or surface area of the patient to be treated. The size 
of the dose also will be determined by the existence, nature, and extent of any adverse 
side-effects that accompany the administration of a particular vector, or transduced cell 

10 type in a particular patient. 

In determining the effective amount of the nucleic acid, immunogenic 
composition or vector to be administered in the treatment or prophylaxis against 
Neisseria meningitidis, the physician evaluates circulating plasma levels, vector and 
therepeutic moeity (e.g., anti- Nesseria mRNA ribozyme) toxicities, progression of the 

15 disease, and the production of znti-Neisseria meningitidis antibodies. 

For administration, vectors, nucleic acids, immunogenic compositions and 
transduced cells of the present invention can be administered at a rate determined by the 
LD-50 of the vector, immunogenic composition, or transduced cell type, and the side- 
effects of the vector, nucleic acid, immunogenic composition, or cell type at various 

20 concentrations, as applied to the mass and overall health of the patient. Administration 
can be accomplished via single or divided doses. For a typical 70 kg patient, a dose 
equivalent to approximately . l/*g to 10 mg of vector or nucleic acid are administered. A 
dose of about . 1/ig to 10 mg of most immunogenic compositions will suffice to elicit a 
protective immune response against Neisseria meningitidis. In the case of immunogenic 

25 compositions, booster inoculations of the immunogenic composition are occasionally 

needed, Such booster inoculations are typically administered from once every 5 years up 
to about four times per year. The need for a booster inoculation can be determined by 
measuring the level of zx\\i- Neisseria meningitidis titer in the serum of the inoculated 
individual. 

30 Transduced cells are optionally prepared for reinfusion according to 

established methods. See, Abrahamsen et al (1991) 7. Clin. Apheresis 6:48-53; Carter 
et aL (1988) /. Clin. Apheresis 4:113-117; Aebersold et al. (1988), J. Immunol Methods 
112: 1-7; Muul et al (1987)7. Immunol Methods 101:171-181 and Carter et al (1987) 
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Transfusion 27:362-365. In one class of ex vivo procedures, between 1 X 10 6 and 1 X 
10 9 transduced cells (e.g., stem cells, T cells or B cells transduced with vectors encoding 
a nucleic acid of the invention) are infused intravenously, e.g., over 60-200 minutes. 
Vital signs and oxygen saturation by pulse oximetry are closely monitored. Blood 
samples are obtained 5 minutes and 1 hour following infusion and saved for subsequent 
analysis. Leukopheresis, transduction and reinfusion may be repeated about every 2 to 3 
months for a total of 4 to 6 treatments in a one year period. After the first treatment, 
infusions can be performed on a outpatient basis at the discretion of the clinician. 

If a patient undergoing infusion of a vector, immunogenic composition, or 
transduced cell develops fevers, chills, or muscle aches, he/she typically receives the 
appropriate dose of aspirin, ibuprofen or acetaminophen. Patients who experience 
reactions to the infusion such as fever, muscle aches, and chills are premedicated 30 
minutes prior to the future infusions with either aspirin, acetaminophen, or 
diphenhydramine. Meperidine is used for more severe chills and muscle aches that do 
not quickly respond to antipyretics and antihistamines. Cell infusion is slowed or 
discontinued depending upon the severity of the reaction. 

The effect of the therapeutic vectors, immunogenic compositions, or 
transduced cells of the invention on Neisseria meningitidis infection and meningitis are 
measured by monitoring the level of Neisseria meningitidis in a patient, or by monitoring 
the ami- Neisseria meningitidis antibody count for the patient over time. Typically, 
measurements are taken before, during and after the therapeutic or prophylactic regimen. 

EXAMPLES 

The following examples are provided by way of illustration only and not 
by way of limitation. Those of skill will readily recognize a variety of noncritical 
parameters which can be changed or modified to yield essentially similar results. 
Example 1: ORF 1 fSEQ ID NO:2 ORF 2 (ORF 2a TSEO ID NOA). ORF 2b (SEP ID 
NO;5» and ORF 3 fSEO ID NQ:7^ and Invasion Deficient Strains of Neisseria 
meningitidis 

Several hundred N. meningitidis serogroup B, strain NMB, Tn916 
transposon mutants were screened for an increased or decreased ability to attach or 
invade human endometrial tissue culture (HEC-l-B) cells. Using this approach, we 
identified and characterized a mutant, VVV6, which showed a > 10-fold decrease in its 
ability to invade HEC-l-B cells compared to the parent NMB strain and to an additional 
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well characterized capsule deficient mutant, M7, (Stephens, D.S., et al (1991), Infect. 
Immun., 59:4097-4102) (Figure 2). The results obtained from growth curves and the 
various controls used in the attachment-invasion assays revealed no significant difference 
in the growth rate between NMB and VVV6. The results of lipooligosaccharide (LOS) 
analysis from strains NMB, Dl, M7, and VVV6 show that strain Dl cannot sialylate 
LOS; this deficiency resulted in the loss of the sialilyted LOS band. There are no 
detectable differences in the LOS profiles from NMB and VVV6. One dimensional SDS- 
PAGE analysis of outer membrane protein demonstrated that VVV6 had an identical 
profile to the parent strain. Electron microscopic analysis showed no difference in the 
quantity or morphology of the observable pili between NMB and VVV6. 

Southern analysis on VVV6 genomic DNA digested with EcoKL, HindllL 
and Sau3Al hybridized with a transposon specific probe showed band patterns consistent 
with that of chromosomal DNA that contains only one copy of the transposon (Swartley, 
J.S., et al, (1993), Mol Microbiol., 10:299-310). NMB has no TnP/6 transposon 
inserted in the chromosome, and as expected there is no band observed in the Southern 
hybridization. In addition, DNA sequence analysis showed that the transposon insertion 
is of the Class 1 type (Hitchcock et al. (1983)); the entire transposon is inserted and 
stably maintained in the host genome (Swartley, J.S., et al, (1993), MoL Microbiol, 
10:299-310). 

Linkage of the mutant phenotype with the location of the transposon 
insertion was demonstrated by homologous recombination experiments. Transformation 
of the parent strain with genomic DNA from mutant VVV6 yielded recombinants that 
showed the mutant phenotype when tested on the tissue culture monolayer (Figure 3). 
Tetracycline resistant back-transformants were obtained at a frequency of 1.3x10 
VmgDNA. A total of seven recombinants were tested, all of which showed a decreased 
ability to invade HEC-l-B ceils. The polymerase chain reaction and DNA sequence 
analysis were used to determine the location of the transposon insertion in each of the 
transformants. The results showed that the transposon insertions occurred in the exact 
same position observed in VVV6. 

Nucleotide sequence analysis on a 5kb fragment showed that the TnP76 
insertion occurred between two open reading frames (Seq 3 (SEQ ID NO: 6), encoding 
ORF 3 (SEQ ID NO: 7), and Seq 2 (SEQ ID NO:3), encoding ORF 2 (ORF 2a (SEQ ID 
NO:4), ORF 2b (SEQ ID NO:5))). Seq 3 (SEQ ID NO:6) shows no significant 
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homology to any gene in GenBank. Seq 2 (SEQ ID NO: 3) is 60%, identical to a gene in 
E. coli with no known function. Further DNA sequence analysis revealed a third open 
reading frame (Seq 1 (SEQ ID NO:l), encoding ORF 1 (SEQ ID NO:l)) downstream 
from Seq 2 (SEQ ID NO:3). The nucleotide sequence of Seq 1 (SEQ ID NO: I) is 61% 
5 identical to the ftsZ gene from E. coli, a gene that has been shown to be essential for cell 
division and septum formation (See also, Example 2 below). 

The recent development of a transposon mutagenesis system (Buddingh, 
G.J., et aL (1987), Science, 86:20-21; Clark et al (1987)) and the use of more 
appropriate virulence model systems provide the opportunity to gain new insight into 

10 meningococcal disease. We have identified a transposon mutant, VVV6, that shows a 
decreased ability to invade HEC1-B tissue culture cells compared to the NMB parent 
strain and a well characterized capsule deficient mutant, M7. Since both NMB and 
VVV6 show identical replication rates in vitro, the lower numbers of viable counts 
obtained on the tissue culture assay are most likely due to a diminished ability of VVV6 

15 to invade tissue culture cells. 

The VVV6 strain produces identical lipopoly saccharide and SDS-PAGE 
protein profiles and has similar quantity and type of pili compared to its parental strain. 
These data in conjunction with the fact that capsule analysis on VVV6 did not reveal any 
distinguishable differences compared to the parent strain suggests that the mutation 

20 responsible for the altered phenotype in mutant VVV6 is not likely due to capsule 

deficiency or deficiency in any of the other major surface factors. The decreased ability 
of mutant VVV6 to invade HEC-l-B cells are linked to the disruption of a gene(s) 
encoding for a factor(s) necessary for recognition of a host cell receptor. 

Bacteria and tissue culture cells- N. meningitidis serogroup B strain NMB, and 

25 construction of the Tn£76-derived mutant library are described elsewhere (Buddingh, 

G.J., et al. y Science, 86:20-21). All Neisseria strains were maintained on CHOC-II agar 
(Carr-Scarborough, Atlanta, GA). The human endometrial carcinoma cell line, HEC1-B, 
was maintained by the Biological Products Branch, CDC, Atlanta, GA. Nearly confluent 
monolayers were maintained in minimal essential medium (MEM) with 10% fetal bovine 

30 serum (Gibco). 

Attachment-Invasion Assay. Parent and Tn916 mutant strains were grown from 
frozen stocks on CHOC-II agar plates to late log phase (18 hours) at 37°C in 5%C0 2 . 
The cells were scraped and resuspended in MEM without serum to an OD 600 of 0.5 
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(approximately 10 8 CFU/ml). Monolayers of Hec-l-B cells in 24 well dishes were 
infected using the resuspended cells. This produced a multiplicity of infection (MOI) of 
10:1 (bacteria: host cell). Infection of tissue culture cells was allowed to proceed for 5 
hours in 5% C0 2 at 37°C. After the infection was completed, each well was washed 5 
5 times with MEM to remove most unattached bacterial cells. 

To assay for attachment and invasion, 1 ml of MEM was added to each 
well. The monolayers were scraped, the suspension was diluted lO 3 , and 100^1 of this 
suspension was plated onto CHOCII agar and incubated with the cells at 37<>C overnight. 
To assay for invasion only, 1 ml of MEM containing gentamicin (125 ^ig/ml) was added 
10 to each well after the initial 5 hr assay and incubated at 37°C in 5% C0 2 for 90 min. 
Monolayers were then washed twice with MEM. One- ml of MEM was added to each 
well and the monolayers were scraped and diluted. Fifty-/xl of the suspension were 
plated on CHOCII agar plates. Plates were incubated overnight at 37<>C in 5% CO.. 

Nasopharyngeal organ cultures. Construction of the human nasopharyngeal organ 
15 culture model has been previously described (Stephens, D.S., et al. (1991), Rev Infect 
Dis., 13:22-33). The model uses tissues obtained from children undergoing elective 
adenoidectomy, and allows quantitative and qualitative assessment of the stages of 
attachment and invasion of N. meningitidis to the mucosal surface. This model system 
was used as a secondary screening method to evaluate the attachment and invasion 
20 properties of the mutant(s) identified using the HEC-1B monolayers. Nasopharyngeal 
organ cultures were incubated with meningococci. After 12 hours of incubation the 
organ cultures were washed and the associated bacteria were enumerated by 
homogenization of each organ culture and with subsequent dilution and plating for colony 
counts. 

25 Outer Membrane Protein Assay. Meningococcal outer membrane proteins were 

isolated as described by Clark et al, 1987. This method utilizes differential 
centrifugation followed by precipitation of outer membrane proteins in 2% sarcosyl. The 
samples were resolved on SDS-PAGE and the proteins visualized by Coomassie blue or 
silver stain. 

30 Lipooligosacharide preparation. LOS was prepared by lysis of bacteria in 

distilled water followed by proteinase K digestion as described by Hitchcock et al., 1983. 

Electron microscopy of pili. Negative staining grids of meningococci were 
prepared by fixation in 1 % glutaraldehyde (cacodylate buffer) and staining with 1 % 
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phosphotungstaie, and examination by transmission electron microscopy. 

Southern Analysis. Southern analysis was performed to demonstrate that Tn916 
was inserted in the genomic DNA of mutant VVV6. A digoxigenin-labeled plasmid 
containing transposon Tn916 was used as a probe. Genomic DNA from VVV6 and 
5 NMB was isolated and digested with the appropriate restriction enzymes and transferred 
onto a nylon membrane (Kathariou, S., et al MoL Microbiol, 4:729-735). 
Hybridization was carried out as described in the Genius System manual (Boehringer 
Mannheim Biochemicals). Briefly, blotted membranes were placed in hybridization tubes 
containing 20ml of prehybridization solution (5X SSC, 1% (w/v) blocking reagent, 0.1% 
10 N-laurysarcosine, 0.02% SDS) and incubated in a hybrization oven at 50°C for at least 
lh. The prehybridization solution was replaced with 20ml of hybridization solution 
(prehybridization solution containing the digoxygenin-labeled probe) and incubated over 
night at 52°C. The membrane was washed 2X for 5min with a 2X SSC buffer containing 
0.1% SDS, then washed 3X for 5min with a buffer consisting of 0.5X SSC and 0.1% 
15 SDS. All washes were carried out at room temperature. Colorimetric detection of DNA 
bands was performed as suggested by the manufacturer. 

DNA amplification by PCR. PCR was used to amplify chromosomal DNA 
fragments flanking Tn916. The sequences at the ends of the transposon were previously 
reported (Clewell, D.B., et al. (1988), J. Baa., 170:3046-3052) and were used in the 
20 design specific oligonucleotides that served as anchor primers for PCR amplification of 
adjacent chromosomal DNA. Amplification and isolation of the unknown genomic DNA 
sequences was performed as previously described (Efrain M. Ribot, et al. (1996), Gene. 
Briefly, mutant genomic DNA was isolated and digested with Saw3Al restriction 
endonuclease. This restriction enzyme cuts both arms of the transposon near the 
25 transposon-chromosome junction. After digestion was completed, the samples were 

phenokchloroform-extracted, ethanol-precipitated and vacuum-desiccated using standard 
methodologies described by Sambrook et aL The DNA pellet was then resuspended in 
20ml of TE buffer, 2ml of Saul A linkers (250mM/ml) and 2 fil of 10X T4 DNA ligase 
buffer and T4 DNA ligase (10 units) were added. The ligation reaction was incubated at 
30 room temperature for least 3 hours at. The samples were then phenol: chloroform 
extracted, ethanol precipitated and resuspended in 20ml of TE buffer. 

The ligation mixture is then subjected to unidirectional PCR amplification 
(15 cycles: 95°C;1 min, 52°C;1 min, 72°C;l l / 2 min in 25ml volumes) of the target DNA 
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using 5* biotin-labeled anchor primers specific for the known sequences of the right arm 
and left arm of the transposon. The resulting single-stranded PCR product contained the 
adjacent unknown chromosomal DNA flanked by the remaining portion of the 
transposon and the sequences corresponding to the ligated linker. The biotin-labeled 
5 single-stranded PCR (ssPCR) product containing the flanking chromosomal DNA was 
captured using streptaiving-coated beads as described by the manufacturer (Dynal AS, 
Oslo, Norway). 

The particle-isolated ssPCR products were subjected to 25 cycles of PCR 
amplification (94°C:1 min; 50°C:30 sec; 72°C:1V4 min in 25/xl volumes). Transposon 

10 and linker specific primers were used for this purpose. The resulting PCR fragments 

were cloned or sequenced directly as described by Ribot et ah, manuscript submitted for 
publication. All the oligonucleotide primers used in this study were synthesized by the 
CDC Biotechnology Core Facility. 

DNA sequencing. Automated DNA sequence analysis was performed using both 

15 the Sanger dideoxy method (Amplitaq for sequencing, Perking-Elmer, Foster City, 

California) and the dye terminator reaction method as described in the ABI instruction 
manual. 

Example 2; the Neisseria meningitidis ftsZ Homolog ue 

The nucleotide sequence of a 1.2 kb DNA fragment of Neisseria meningitidis 

20 DNA that contains an open reading frame (Seq 1 (SEQ ID NO:l), encoding ORF 1 (SEQ 
ID NO:2)) that is highly homologous to the corresponding ORF from the Escherichia 
coliftsZ gene is described in this example. The E. coliftsZ gene codes for a GTP- 
binding protein essential for septum formation and cell division. The 1.2 kb N. 
meningitidis ORF 1 is 61% identical, at the nucleotide sequence level, to the ftsZ gene of 

25 £. coli and 50% identical at the amino acid level. The predicted polypeptide contains a 
glycine-rich stretch of seven amino acids that is identical to the highly conserved GTP- 
binding domain found in all the JisZ genes identified thus far. Based on these data, Seq 
1 (SEQ ID NO:l) codes for the N. meningitidis cell division protein FtsZ. 

DNA amplification by PCR. Neisseria meningitidis mutant and wild-type strains 

30 were grown on CHOCII agar (Carr-Scarborough, Atlanta, GA) plates at 37«C in 5%C0 2 
over night. Genomic DNA was isolated using the Isoquick nucleic acid extraction kit 
(ORCA Research Inc., Bothell, WA) under the conditions described by the manufacturer. 
The procedure used for the amplification of chromosomal DNA fragments was based on 
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a method developed for the rapid amplification of transposon ends (RATE). A modified 
version of RATE was used to chromosome walk up- and downstream from the 
transposon insertion site in mutant VVV6. Briefly, genomic DNA was isolated from the 
bacterial strain and 5 fig digested with the desired restriction endonuclease. The 
5 restriction enzyme Hindlll was used. After digestion was completed, the sample was 

phenol: chloroform treated and vacuum using standard methods (Sambrook et aL, 1989). 
The pellet containing the total genomic digest is resuspended in 15/zl of double distilled 
sterile H 2 0 and 2ml of the appropriate linkers (250mM/ml), 10 units of DNA ligase, and 
2.5^1 of 10X T4 DNA ligase buffer added and the sample volume adjusted to 25ml with 

10 double-distilled sterile water. The ligation reaction was then allowed to proceed for at 
least three hours at room temperature. Construction of the HIEC linker was done by 
adding equimolar amounts of each oligonucleotide, HEIC1 
(AGCTTGAGGTCGACGGGATATCG) (SEQ ID NO: 10) and HEIC2 
(AATTCGATATCCCGTCGACCTCA) (SEQ ID NO:l 1), incubating at 90°C for 5min 

15 and allowing the samples to cool slowly to room temperature. Excess linkers are 
removed by passing the samples trough MicroconlOO filters as described by the 
manufacturer (Amicon Inc., Beverly, MA). 

Unidirectional PCR amplification (15 cycles: 95°C;1 min, 52°C;1 min, 
72°C;l ! /i min in 25ml volumes) of the target sequence was performed using a 5' biotin- 

20 labeled primer/reaction (B800F1 CACATAAGGCGTGGTGGAAG (SEQ ID NO: 12)) 
specific for the known genomic sequence obtained from previous sequencing reactions. 
This unidirectional amplification reaction yields single-stranded DNA molecules 
containing the chromosomal target sequence, the adjacent unknown chromosomal DNA, 
and the linker. Streptavidin coated beads (Dynal AS, Oslo, Norway) were used to 

25 capture the PCR-amplified biotin-labeled single-stranded products following the 

manufacturers recommendations. Aliqots of the purified single-stranded PCR products 
were then subjected to 30 cycles of PCR amplification (94°C:1 min; 42°C:30 sec; 
72°C:116 min in 25ml volumes), using a nested primer specific for the for the known 
sequence (800F8 CTCCCAAACCGGACAAACCG (SEQ ID NO: 13)) and a primer 

30 corresponding to the ligated linker (HIEC2). A 5ml aliquot of each of the resulting 

double-stranded PCR products was loaded onto a 0.8% agarose gel to determine product 
size and purity (data not shown). Selected products were then subjected to automated 
DNA sequence analysis using primers specific for both the known genomic (800F9 
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GTCAAGTACGGACTGATTGTCG (SEQ ID NO: 14)) sequence and the HEIC2 linker 
primer. 

DNA sequencing. Automated DNA sequence analysis of PCR amplified 
fragments was performed using the dye terminator reaction method as described in the 
5 ABI-373 instruction manual (Perking-Elmer, Foster City, California). Computer assisted 
analysis was performed using the Wisconsin Sequence Analysis Package (GCG) 
(Madison, Wisconsin) and DNASIS, (National Bioscience, Inc, Plymouth, Minnesota). 

The TnP/6 transposon mutant of N. meningitidis, serogroup B, strain 
NMB, demonstrated a significant decrease in its ability to invade human epithelial tissue 

10 culture cells compared to control strains. Sequencing analysis on VVV6 genomic DNA 
indicated that the transposon insertion occurred between two possible open reading 
frames (Seq 3 (SEQ ID NO: 6) and Seq 2 (SEQ ID NO: 3)) (Figure 1). Further DNA 
sequence analysis on the region downstream from Seq 2 (SEQ ID NO: 3) revealed a 
another ORF (Seq 1 (SEQ ID NO:l)). Nucleotide sequence comparison of this ORF 

15 (Seq 1 (SEQ ID NO:l)) using the FASTA algorithm of the GCG Wisconsin package 

shows that the nucleotide sequence of Seq 1 (SEQ ID NO: 1) is over 61 % identical to the 
E. coli essential cell division gene ftsZ. All ftsZ genes identified to date show a high 
degree of homology. We have also identified both a possible ribosome binding site and 
start codon for this ORF (Seq 1 (SEQ ID NO:l)) and there are two possible stop codons 

20 at nucleotide positions 1100 and 1148. Primer extension and SI nuclease protection 
studies are used to determine the precise location of promoter regions and termination 
sequences of Seq 1 (SEQ ID NO:l), Seq 2 (SEQ ID NO:3) and Seq 3 (SEQ ID NO:6). 

The amino acid sequence of the ORF 1 polypeptide (SEQ ID NO: 1) is 
50% identical to the FtsZ protein from E. coli and B. subtilis. Furthermore, the amino 

25 acid sequence of the N. meningitidis FtsZ protein contains the highly conserved GTP- 

binding domain present in all the FtsZ proteins identified thus far (de Boer, et aL (1992) 
Nature 359:254-56; Mukherjee, et aL (1993) Proc. Natl. Acad. ScL USA. 90:1053-57; 
Beall, et aL (1988) J. BacterioL 170:4855-4864). 

A highly conserved glycine-rich stretch of amino acids (GGGTGTG (SEQ 

30 ID NO: 15)) has been found in all the FtsZ proteins identified so far (Corton, et aL 
(1987) J. BacterioL 169:1-7; de Boer, et aL (1992) Nature 359:254-56). As can be 
observed from amino acid residues at approximately 109 to 115 of ORF 1 (SEQ ID 
NO:l), the amino acid sequence of the polypeptide encoded by ORF 1 (SEQ ID NO:l) 
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also contains this highly conserved domain. This provides additional evidence that the 
gene product encoded by the Neisseria ORF is the homolog of the FtsZ protein from E. 
coli. In vitro assays indicate that this glycine-rich sequence contains a domain with 
GTP/GDP-binding activity (Corton, et al. (1987) J. BacterioL 169:1-7; de Boer, et aL 
5 (1992) Nature 359:254-56; Mukherjee, et aL (1993) Proc. Natl. Acad. Sci. USA. 

90:1053-57). Escherichia coli cells have been characterized that carry mutations within 
this amino acid stretch that result in a cell division deficient phenotype. The inability of 
such mutants to divide has been linked to reduced GTPase activity (Cook, et al. (1994) 
Mol. Microbiol. 14:485-495; Ricard, et al. (\9Ti)J. BacterioL 116:314-322). It has 

10 been demonstrated that the E. coli functional unit of FtsZ consists of multiple copies of 
FtsZ assembled together in a multimeric complex. It appears that the GTPase activity is 
required for the assembly of such a complex. If a mutated FtsZ has a decreased ability 
to bind GTP, complex formation will not occur as it would under normal conditions, thus 
diminishing the cell's ability to divide. This stretch of amino acids is not only conserved 

15 among the eubacteria (Lutkenhaus, et al. (1980) J. BacterioL 142:615-620; Miyakawa, et 
aL (1972) J. Bacteriol 112:959-958), but is also remarkably similar to the a-, b-, and g- 
tubulins from eukaryotic cells (Gill, et al. (1986) MoL Gen. Genet. 205:134-145). FtsZ 
may be the predecessor of the more evolutionary recent tubulin (Bermudez, et al. 
(1994) Microbiol. Rev. 58:387-400). This hypothesis is supported by the recent 

20 discovery of an ftsZ homolog gene from the archaebacterium Halobacterium salinarum. 

Amino acid sequence aligment of the H. salinarum FtsZ showed remarkable similarity to 
the FtsZ proteins from eubacteria and tubulins from eucaryotic ceils. 

In £. coli, ftsZ is preceded by the fisA gene and followed by the envA 
gene. The nucleotide sequence of a 225bp long segment of DNA upstream of ORF 1 

25 (SEQ ID NO:7) from N. meningitidis, NMB, was obtained, but failed to reveal any 

significant homology to the ftsA gene from E. coli. The DNA sequence downstream of 
the Neisseria ftsZ also revealed no homology to the E. coli envA gene. This is not 
surprising since the DNA regions flanking the ftsZ gene from organisms such as Bacillus 
subtilis (Beall, et aL (1988) J. BacterioL 170:4855-4864), Streptomyces coleicolor 

30 (McCormick, et aL (1994) Mol. Microbiol. 14:243-254), and H. salinarum (Margolin, et 
al. (1996) J. BacterioL 178:1320-1327) do not show the same genetic map observed in 
E. coli. 

While a hypothetical ribosome binding site (RBS) and start codon (ATG) 
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were found, no obvious consensus promoter sequence was identified in association with 
theyfrZ-homolog gene. This ORF may be controlled by a promoter located elsewhere in 
the DNA region upstream; in E. coli f the promoter controlling expression of ftsZ is 
found upstream within thefisA gene. Primer extension analysis ultimately defines the 
5 start site of transcription. In addition, there is no obvious termination sequence at the 
end of the ORF of the JfoZ-homolog, suggesting that the gene is expressed as part of a 
polycistronic message in Neisseria meningitidis. Interestingly, computer analysis 
revealed a strong termination loop at the end of Seq 2 (SEQ ID NO:3); this may indicate 
the end of transcription of the polygenic mRNA. Again, this genetic arrangement bears 

10 a strong resemblance to the ftsZ gene region from E. colt., which consists of an operon- 
like structure containing the ftsQ, ftsA y ftsZ, and envA genes. 
Discussion of the Accompanying Sequence Listing 

SEQ ID NO; 8 provides the sequence of Seq 4. This sequence 
encompasses Seq 1, Seq 2, and Seq 3, which are additionally provided at SEQ ID NO:l, 

15 SEQ ID NO:3, and SEQ ID NO:6, respectively. The information for the nucleic acid 
sequences are presented as DNA sequence information. One of skill will readily 
understand that portions of the sequences also describe RNAs encoded by the sequence 
(e.g, by substitution of T residues with corresponding U residues), and a variety of 
conservatively modified variations, including silent substitutions of the sequences. While 

20 only a single strand of sequence information is shown, one of skill will immediately 

appreciate that the complete corresponding complementary sequence is fully described by 
comparison to the given sequences. 
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SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:5, and SEQ ID NO:7 provide 
the amino acid sequences of ORF 1, ORF 2a, ORF 2b, and ORF 3, respectively. A 
variety of conservatively modified variations of the amino acid sequences provided will 
be apparent to one of skill, and are described herein. One of skill will also recognize 
5 that a variety of nucleic acid sequences encode each of the polypeptides due to the codon 
degeneracy present in the genetic code. Each of the nucleic acids which encodes the 
given polypeptide is described by comparison to the amino acid sequence and translation 
via the genetic code. 

10 The above examples are provided to illustrate the invention but not to limit 

its scope- Other variants of the invention will be readily apparent to one of ordinary 
skill in the art and are encompassed by the appended claims. All publications, patents, 
and patent applications cited herein are hereby incorporated 
by reference for all purposes. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Quinn, Frederick D . 

Ribot, Efrain M. 
Stephens, David S. 
Raymond, Nigel 

(ii) TITLE OF INVENTION: Invasion Associated Genes From Neisseria 
meningitidis Serogroup B 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Townsend and Townsend and Crew LLP 

(B) STREET: Two Embarcadero Center, Eighth Floor 

(C) CITY: San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94111-3834 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US Not yet assigned 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(Viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Weber, Kenneth A. 

(B) REGISTRATION NUMBER: 31,677 

(C) REFERENCE /DOCKET NUMBER: 17 639-006000US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 576-0200 

(B) TELEFAX: (415) 576-0300 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..1185 

(D) OTHER INFORMATION: /note= "Seq 1 = position 223 through 

position 1407 of Seq 4" 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 17.. 1102 

(D) OTHER INFORMATION: /product= " ORF 1" 

/note= "ORF 1 CDS = position 238 through 
position 1324 of Seq 4" 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GAGCAGGAGT TTTTGA ATG GAA TTT GTT TAC GAC GTG GCA GAA TCG GCA 49 
Met Glu Phe Val Tyr Asp Val Ala Glu Ser Ala 
15 10 

GTC AGC CCT GCG GTG ATT AAA GTA ATC GGC TTG GGC GGC GGC GGT TGC 97 
Val Ser Pro Ala Val He Lys Val He Gly Leu Gly Gly Gly Gly Cys 
15 20 25 

AAT GCA TCC AAT AAC ATG GTT GCC AAC AAT GTG CGC GGT GTG GAG TTT 145 
Asn Ala Ser Asn Asn Met Val Ala Asn Asn Val Arg Gly Val Glu Phe 
30 35 40 

ATC AGT GCC AAT ACG GAT GCG CAG TCT CTG GCA AAA AAC CAT GCG GCG 193 
He Ser Ala Asn Thr Asp Ala Gin Ser Leu Ala Lys Asn His Ala Ala 
45 50 55 

AAG AGA ATC CAG TTG GGT ACG AAT CTG ACA CGC GGT TTG GGC GCG GGC 241 
Lys Arg lie Gin Leu Gly Thr Asn Leu Thr Arg Gly Leu Gly Ala Gly 
60 65 70 ~ 75 

GCG AAT* CCC GAT ATC GGC CGT GCG GCA GCC CAG GAA GAC CGG GAA GCC 289 
Ala Asn Pro Asp He Gly Arg Ala Ala Ala Gin Glu Asp Arg Glu Ala 
80 85 90 

ATT GAA GAA GCC ATT CGC GGT GCG AAT ATG CTG TTT ATC ACG ACC GGT 337 
He Glu Glu Ala He Arg Gly Ala Asn Met Leu Phe He Thr Thr Gly 
95 100 105 

ATG GGC GGC GGT ACC GGT ACC GGT TCC GCG CCG GTT GTT GCT GAG ATT 385 
Met Gly Gly Gly Thr Gly Thr Gly Ser Ala Pro Val Val Ala Glu He 
110 115 120 

GCC AAG TCT TTG GGC ATT CTG ACC GTT GCC GTG GTT ACC CGA CCG TTC 433 
Ala Lys Ser Leu Gly He Leu Thr Val Ala Val Val Thr Arg Pro Phe 
125 130 135 

GCA TAT GAA GGT AAG CGC GTC CAT GTC GCA CAG GCA GGG TTG GAA CAG 481 
Ala Tyr Glu Gly Lys Arg Val His Val Ala Gin Ala Gly Leu Glu Gin 
140 145 150 155 

TTG AAA GAA CAC GTC GAT TCG CTG ATT ATC ATC CCG AAC GAC AAA CTG 529 
Leu Lys Glu His Val Asp Ser Leu He He He Pro Asn Asp Lys Leu 
160 165 170 

ATG ACT GCA TTG GGT GAA GAC GTA ACG ATG CGC GAA GCC TTC CGT GCC 577 
Met Thr Ala Leu Gly Glu Asp Val Thr Met Arg Glu Ala Phe Arg Ala 
175 180 185 

GCC GAC AAT GTA TTG CGC GAT GCG GTC GCA GGC ATT TCC GAA GTG GTA 625 
Ala Asp Asn Val Leu Arg Asp Ala Val Ala Gly He Ser Glu Val Val 
190 195 200 

ACT TGC CCG AGC GAA ATC ATC AAC CTC GAC TTT GCC GAC GTG AAA ACC 673 
Thr Cys Pro Ser Glu He He Asn Leu Asp Phe Ala Asp Val Lys Thr 
205 210 215 

GTG ATG AGC AAC CGC GGT ATC GCT ATG ATG GGT TCG GGT TAT GCC CAA 721 
Val Met Ser Asn Arg Gly He Ala Met Met Gly Ser Gly Tyr Ala Gin 
220 225 230 235 

GGT ATC GAC CGT GCG CGT ATG GCG ACC GAC CAG GCC ATT TCC AGT CCG 769 
Gly He Asp Arg Ala Arg Met Ala Thr Asp Gin Ala He Ser Ser Pro 
240 245 250 
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CTG CTG GAC GAT GTA ACC TTG GAC GGA GCG CGC GGT GTG CTG GTC AAT 817 

Leu Leu Asp Asp Val Thr Leu Asp Gly Ala Arg Gly Val Leu Val Asn 
255 260 265 

ATT ACG ACT GCT CCG GGT TGC TTG AAA ATG TCC GAG TTG TCC GAA GTC 865 
He Thr Thr Ala Pro Gly Cys Leu Lys Met Ser Glu Leu Ser Glu Val 
270 275 280 

ATG AAA ATC GTC AAC CAA AGC GCG CAT CCC GAT TTG GAA TGC AAA TTC 913 
Met Lys He Val Asn Gin Ser Ala His Pro Asp Leu Glu Cys Lys Phe 
285 290 295 

GGT GCT GCT GAA GAC GAG ACC ATG AGC GAA GAT GCC ATC CGG ATT ACC 961 
Gly Ala Ala Glu Asp Glu Thr Met Ser Glu Asp Ala He Arg He Thr 
300 305 310 315 

ATT ATC GCT ACC GGT CTG AAA GAA AAA GGC GCG GTC GAT TTT GTT CCG 1009 
He He Ala Thr Gly Leu Lys Glu Lys Gly Ala Val Asp Phe Val Pro 
320 325 330 

GCA AGG GAG GTA GAA GCG GTT GCC CCG TCC AAA CAG GAG CAA AGC CAC 1057 
Ala Arg Glu Val Glu Ala Val Ala Pro Ser Lys Gin Glu Gin Ser His 
335 340 345 

AAT GTC GAA GGT AGA TCC GCA CCA ATC GCG GTA TCC GCA CGA 1099 
Asn Val Glu Gly Arg Ser Ala Pro He Ala Val Ser Ala Arg 
350 355 360 

TGAACCTTAC CGCTGCGGAT TTCGACAATC AGTCCGTACT TGACGACTTG AAATCCCTGC 1159 

GATTTTGCGT CGTCAACACA ATTCAG 1185 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 361 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Glu Phe Val Tyr Asp Val Ala Glu Ser Ala Val Ser Pro Ala Val 
15 10 15 

He Lys Val He Gly Leu Gly Gly Gly Gly Cys Asn Ala Ser Asn Asn 
20 25 30 

Met Val Ala Asn Asn Val Arg Gly Val Glu Phe He Ser Ala Asn Thr 
35 40 45 

Asp Ala Gin Ser Leu Ala Lys Asn His Ala Ala Lys Arg He Gin Leu 
50 55 60 

Gly Thr Asn Leu Thr Arg Gly Leu Gly Ala Gly Ala Asn Pro Asp He 
65 70 75 80 

Gly Arg Ala Ala Ala Gin Glu Asp Arg Glu Ala He Glu Glu Ala He 
85 90 95 

Arg Gly Ala Asn Met Leu Phe He Thr Thr Gly Met Gly Gly Gly Thr 
100 105 110 

Gly Thr Gly Ser Ala Pro Val Val Ala Glu He Ala Lys Ser Leu Gly 
115 120 125 
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He Leu Thr Val Ala Val Val Thr Arg Pro Phe Ala Tyr Glu Gly Lys 
130 135 140 

Arg Val His Val Ala Gin Ala Gly Leu Glu Gin Leu Lys Glu His Val 
145 150 155 160 

Asp Ser Leu He He He Pro Asn Asp Lys Leu Met Thr Ala Leu Gly 
165 170 175 

Glu Asp Val Thr Met Arg Glu Ala Phe Arg Ala Ala Asp Asn Val Leu 
180 185 " 190 

Arg Asp Ala Val Ala Gly He Ser Glu Val Val Thr Cys Pro Ser Glu 
195 200 205 

He He Asn Leu Asp Phe Ala Asp Val Lys Thr Val Met Ser Asn Arg 
210 215 220 

Gly He Ala Met Met Gly Ser Gly Tyr Ala Gin Gly He Asp Arg Ala 
225 230 235 240 

Arg Met Ala Thr Asp Gin Ala He Ser Ser Pro Leu Leu Asp Asp Val 
245 250 255 

Thr Leu Asp Gly Ala Arg Gly Val Leu Val Asn He Thr Thr Ala Pro 
260 265 270 

Gly Cys Leu Lys Met Ser Glu Leu Ser Glu Val Met Lys He Val Asn 
275 280 285 

Gin Ser Ala His Pro Asp Leu Glu Cys Lys Phe Gly Ala Ala Glu Asp 
290 295 300 

Glu Thr Met Ser Glu Asp Ala He Arg He Thr lie He Ala Thr Gly 
305 310 315 320 

Leu Lys Glu Lys Gly Ala Val Asp Phe Val Pro Ala Arg Glu Val Glu 
325 330 335 

Ala Val Ala Pro Ser Lys Gin Glu Gin Ser His Asn Val Glu Gly Arg 
340 " 345 350 

Ser Ala Pro He Ala Val Ser Ala Arg 
355 360 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 960 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



( ix ) FEATURE : 

(A) NAME/KEY: - 

<B) LOCATION: 1..960 

(D) OTHER INFORMATION: /note= "Seq 2 - positions 1921 through 

2880 of Seq 4" 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 39,. 941 

(D) OTHER INFORMATION: /product= " ORF 2a" 

/note- " ORF 2 protein variant using 
alternate start site at position 39 of 
Seq 2 (position 1959 through position 
2861 of Seq 4)" 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 51.. 941 

(D) OTHER INFORMATION: /product = "ORF 2b" 

/note= n ORF 2 protein variant using 
alternate start site at position 51 of 
Seq 2 (position 1971 through position 
2861 of Seq 4)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TTTTTTAAAG TCAGGGAAAT GCTGTCAACG CACTGCCT ATG GGT TTG AAA ATG 53 

Met Gly Leu Lys Met 
1 5 

TCG ATT GCT GCC GGT ATC GGT TTG TTT TTG GCA CTG ATT TCC CTG AAA 101 
Ser lie Ala Ala Gly lie Gly Leu Phe Leu Ala Leu lie Ser Leu Lys 
10 15 20 

GGC GCA GGC CAT TAT CGT TGC CAA TCC GGC AAC CTT GGT CGG TTT GGG 149 
Gly Ala Gly His Tyr Arg Cys Gin Ser Gly Asn Leu Gly Arg Phe Gly 
25 30 35 

CGA TAT TCA TCA GCC GTC CGC GTT GTT GGC ACT GTT CGG TTT TGC TAT 197 
Arg Tyr Ser Ser Ala Val Arg Val Val Gly Thr Val Arg Phe Cys Tyr 
40 45 50 

GGT GGT CGT ATT GGG ACA TTT CCG CGT TCA AGG CGC AAC ATC ATC ACC 245 
Gly Gly Arg lie Gly Thr Phe Pro Arg Ser Arg Arg Asn lie lie Thr 
55 60 65 

ATC TTG ACC ATT ACC GTC ATT GCC AGC CTG ATG GGT TTG AAT GAA TTT 293 
He Leu Thr He Thr Val He Ala Ser Leu Met Gly Leu Asn Glu Phe 
70 75 80 85 

CAC GGC ATC ATC GGC GAA GTA CCG AGC ATT GCG CCG ACT TTT ATG CAG 341 
His Gly lie He Gly Glu Val Pro Ser He Ala Pro Thr Phe Met Gin 
90 95 100 

ATG GAT TTT GAA GGC CTG TTT ACC GTC AGC TGG TCA GTG ATT TTC GTC 389 
Met Asp Phe Glu Gly Leu Phe Thr Val Ser Trp Ser Val He Phe Val 
105 110 115 

TTC TTC TTG GTC GAT CTA TTT GAC AGT ACC GGA ACG CTG GTC GGC ATA 437 
Phe Phe Leu Val Asp Leu Phe Asp Ser Thr Gly Thr Leu Val Gly He 
120 125 - 130 

TCC CAC CGT GCC GGG CTG CTG GTG GAC GGT AAG CTG CCC CGC CTG AAA 485 
Ser His Arg Ala Gly Leu Leu Val Asp Gly Lys Leu Pro Arg Leu Lys 
135 140 145 

CGC GCA CTG CTT GCA GAC TCT ACC GCC ATT ATG GCA GGT GCG GCT TTG 533 
Arg Ala Leu Leu Ala Asp Ser Thr Ala He Met Ala Gly Ala Ala Leu 
150 155 160 165 
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GGT ACT TCT TCC ACC ACG CCT TAT GTG GAA AGC GCG GCG GGC GTA TCG 581 

Gly Thr Ser Ser Thr Thr Pro Tyr Val Glu Ser Ala Ala Gly Val Ser 
170 175 180 

GCA GGC GGA CGG ACC GGC CTG ACG GCG GTT ACC GTC GGC GTA TTG ATG 629 
Ala Gly Gly Arg Thr Gly Leu Thr Ala Val Thr Val Gly Val Leu Met 
185 190 195 

CTC GCC TGC CTG ATG TTT TCA CCT TTG GCG AAA AGT GTT CCC GCT TTT 677 
Leu Ala Cys Leu Met Phe Ser Pro Leu Ala Lys Ser Val Pro Ala Phe 
200 205 210 

GGC ACC GCG CCC GCC CTG CTT TAT GTC GGC ACG CAG ATG CTC CGC AGT 725 
Gly Thr Ala Pro Ala Leu Leu Tyr Val Gly Thr Gin Met Leu Arg Ser 
215 220 * 225 

GCG AGG GAT ATT GAT TGG GAC GAT ATG ACG GAA GCC GCA CCC GCA TTC 7 73 

Ala Arg Asp He Asp Trp Asp Asp Met Thr Glu Ala Ala Pro Ala Phe 
230 235 240 245 

CTG ACC ATT GTC TTC ATG CCG TTT ACC TAT TCG ATT GCA GAC GGC ATC 821 
Leu Thr He Val Phe Met Pro Phe Thr Tyr Ser lie Ala Asp Gly He 
250 255 260 

GCC TTC GGC TTC ATC AGC TAT GCC GTG GTT AAA CTT TTA TGC CGC CGC 869 
Ala Phe Gly Phe He Ser Tyr Ala Val Val Lys Leu Leu Cys Arg Arg 
265 270 275 

ACC AAA GAC GTT CCG CCT ATG GAA TGG GTT GTT GCC GTA TTG TGG GCA 917 
Thr Lys Asp Val Pro Pro Met Glu Trp Val Val Ala Val Leu Trp Ala 
280 285 290 

CTG AAA TTC TGG TAT TTG GGC TGATTGATTC GATATTAAAA AT 960 
Leu Lys Phe Trp Tyr Leu Gly 
295 300 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Leu Lys Met Ser He Ala Ala Gly He Gly Leu Phe Leu Ala 
15 10 15 

Leu He Ser Leu Lys Gly Ala Gly His Tyr Arg Cys Gin Ser Gly Asn 
20 25 30 

Leu Gly Arg Phe Gly Arg Tyr Ser Ser Ala Val Arg Val Val Gly Thr 
35 40 45 

Val Arg Phe Cys Tyr Gly Gly Arg He Gly Thr Phe Pro Arg Ser Arg 
50 55 60 

Arg Asn He He Thr He Leu Thr He Thr Val He Ala Ser Leu Met 
65 70 75 80 

Gly Leu Asn Glu Phe His Gly He He Gly Glu Val Pro Ser He Ala 
85 90 95 
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Pro Thr Phe Met Gin Met Asp Phe Glu Gly Leu Phe Thr Val Ser Trv> 
100 105 110 

Ser Val He Phe Val Phe Phe Leu Val Asp Leu Phe Asp Ser Thr Gly 
115 120 125 

Thr Leu Val Gly He Ser His Arg Ala Gly Leu Leu Val Asp Gly Lys 
130 135 140 

Leu Pro Arg Leu Lys Arg Ala Leu Leu Ala Asp Ser Thr Ala lie Met 
145 150 155 160 

Ala Gly Ala Ala Leu Gly Thr Ser Ser Thr Thr Pro Tyr Val Glu Ser 
165 170 175 

Ala Ala Gly Val Ser Ala Gly Gly Arg Thr Gly Leu Thr Ala Val Thr 
180 185 190 

Val Gly Val Leu Met Leu Ala Cys Leu Met Phe Ser Pro Leu Ala Lys 
195 200 205 

Ser Val Pro Ala Phe Gly Thr Ala Pro Ala Leu Leu Tyr Val Gly Thr 
210 215 220 

Gin Met Leu Arg Ser Ala Arg Asp He Asp Trp Asp Asp Met Thr Glu 
225 230 235 240 

Ala Ala Pro Ala Phe Leu Thr He Val Phe Met Pro Phe Thr Tyr Ser 
245 250 255 

He Ala Asp Gly He Ala Phe Gly Phe He Ser Tyr Ala Val Val Lys 
260 265 270 

Leu Leu Cys Arg Arg Thr Lys Asp Val Pro Pro Met Glu Trp Val Val 
275 - " 280 285 

Ala Val Leu Trp Ala Leu Lys Phe Trp Tyr Leu Gly 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 296 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 : 

Met Ser He Ala Ala Gly He Gly Leu Phe Leu Ala Leu He Ser Leu 
15 10 15 

Lys Gly Ala Gly His Tyr Arg Cys Gin Ser Gly Asn Leu Gly Arg Phe 
20 ' 25 30 

Gly Arg Tyr Ser Ser Ala Val Arg Val Val Gly Thr Val Arg Phe Cys 
35 40 45 

Tyr Gly Gly Arg He Gly Thr Phe Pro Arg Ser Arg Arg Asn He He 
50 55 60 

Thr He Leu Thr He Thr Val He Ala Ser Leu Met Gly Leu Asn Glu 
65 70 75 80 
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Phe His Gly lie lie Gly Glu VaX Pro Ser lie Ala Pro Thr Phe Met 
85 90 95 

Gin Met Asp Phe Glu Gly Leu Phe Thr Val Ser Trp Ser Val lie Phe 
100 105 110 

Val Phe Phe Leu Val Asp Leu Phe Asp Ser Thr Gly Thr Leu Val Gly 
115 120 125 

lie Ser His Arg Ala Gly Leu Leu Val Asp Gly Lys Leu Pro Arg Leu 
130 135 ' 140 

Lys Arg Ala Leu Leu Ala Asp Ser Thr Ala lie Met Ala Gly Ala Ala 
145 150 155 160 

Leu Gly Thr Ser Ser Thr Thr Pro Tyr Val Glu Ser Ala Ala Gly Val 
165 170 175 

Ser Ala Gly Gly Arg Thr Gly Leu Thr Ala Val Thr Val Gly Val Leu 
180 185 190 

Met Leu Ala Cys Leu Met Phe Ser Pro Leu Ala Lys Ser Val Pro Ala 
195 200 205 

Phe Gly Thr Ala Pro Ala Leu Leu Tyr Val Gly Thr Gin Met Leu Arg 
210 215 220 

Ser Ala Arg Asp lie Asp Trp Asp Asp Met Thr Glu Ala Ala Pro Ala 
225 230 235 240 

Phe Leu Thr lie Val Phe Met Pro Phe Thr Tyr Ser lie Ala Asp Gly 
245 250 255 

lie Ala Phe Gly Phe lie Ser Tyr Ala Val Val Lys Leu Leu Cys Arg 
260 265 270 

Arg Thr Lys Asp Val Pro Pro Met Glu Trp Val Val Ala Val Leu Trp 
275 280 285 

Ala Leu Lys Phe Trp Tyr Leu Gly 
290 295 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 457 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1.-457 

(D) OTHER INFORMATION: /note- "Seq 3 = position 3381 through 

position 3837 of Seq 4" 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 17.. 457 

(D) OTHER INFORMATION: /product= "ORF 3" 

/note= "ORF 3 CDS = position 3397 through 
position 3837 of Seq 4 M 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

TAATGATTGG ATTGGG ATG CCC GAC GCG TCG GAT GGC TGT GTT TTG CCG 49 
Met Pro Asp Ala Ser Asp Gly Cys Val Leu Pro 

TCC GAA TGT GAT GGA AGC CTG TCC ATA CTG AAA AAA AGT CTA TAN AGG 97 
Ser Glu Cys Asp Gly Ser Leu Ser lie Leu Lys Lys Ser Leu Xaa Arg 
15 20 ' 25 

AGA AAT ATG ATG AGT CAA CAC TCT GCC GGA GCA CGT TTC CGC CAA GCC 145 
Arg Asn Met Met Ser Gin His Ser Ala Gly Ala Arg Phe Arg Gin Ala 
30 35 40 

GTG AAA GAA TCG AAT CCG CTT GCC GTC GCC GGT TGC GTC AAT GCT TAT 193 
Val Lys Glu Ser Asn Pro Leu Ala Val Ala Gly Cys Val Asn Ala Tyr 
45 50 55 

TTT GCA CGA TTG GCC ACC CAA AGC GGT TTC AAA GCC ATC TAT CTG TCT 241 
Phe Ala Arg Leu Ala Thr Gin Ser Gly Phe Lys Ala lie Tyr Leu Ser 
60 65 70 75 

GGC GGC. GGC GTG GCA GCC TGT TCT TGC GGT ATC CCT GAT TTG GGC ATT 289 
Gly Gly Gly Val Ala Ala Cys Ser Cys Gly He Pro Asp Leu Gly He 
80 85 90 

ACC ACA ATG GAA GAT GTG CTG ATC GAC GCA CGA CGC ATT ACG GAC AAC 33 7 

Thr Thr Met Glu Asp Val Leu He Asp Ala Arg Arg He Thr Asp Asn 
95 100 * 105 

GTG GAT NCG CCT CTG CTG GTG GAC ATC GAT GTG GGT TGG GGC GGT GCA 385 
Val Asp Xaa Pro Leu Leu Val Asp He Asp Val Gly Trp Gly Gly Ala 
110 115 120 

TTC AAT ATT GCC CGT ACC ATT CGC AAC TTT GAA CGC GCC GGT GTT GCA 433 
Phe Asn He Ala Arg Thr He Arg Asn Phe Glu Arg Ala Gly Val Ala 
125 130 135 

GCG GTT CAC ATC GAA GAT CAG GTA 457 
Ala Val His He Glu Asp Gin Val 
140 145 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Pro Asp Ala Ser Asp Gly Cys Val Leu Pro Ser Glu Cys Asp Gly 
1 5 10 15 

Ser Leu Ser He Leu Lys Lys Ser Leu Xaa Arg Arg Asn Met Met Ser 
20 25 30 

Gin His Ser Ala Gly Ala Arg Phe Arg Gin Ala Val Lys Glu Ser Asn 
35 40 45 

Pro Leu Ala Val Ala Gly Cys Val Asn Ala Tyr Phe Ala Arg Leu Ala 
50 55 ** 60 
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Thr Gin Ser Gly Phe Lys Ala lie Tyr Leu Ser Gly Gly Gly Val Ala 
65 70 75 80 

Ala Cys ser Cys Gly lie Pro Asp Leu Gly He Thr Thr Met Glu Asp 
85 90 95 

Val Leu He Asp Ala Arg Arg He Thr Asp Asn Val Asp Xaa Pro Leu 
100 105 110 

Leu Val Asp He Asp Val Gly Trp Gly Gly Ala Phe Asn He Ala Arg 
115 120 125 

Thr He Arg Asn Phe Glu Arg Ala Gly Val Ala Ala Val His He Glu 
130 135 140 

Asp Gin Val 
145 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5416 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



( ix ) FEATURE : 

(A) NAME/KEY: - 

(B) LOCATION: 1..5416 

(D) OTHER INFORMATION: /note= "Seq 4 contains Seq 1 (positions 

223-1407), Seq 2 (positions 1921-2880) 
and Seq 3 (positions 3381-3837)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



TGCAGGCATG 


CAAGCTGGAA 


GGAAACTTGC 


CGCAGCCAGG 


AAAACGGTGC 


AGTGCAAGAG 


60 


AGGGAAGGGG 


GCGGCGGTTT 


GTTGGCAAGA 


TTGAAACGGT 


GGATTGAAAA 


CAGCTTCTGA 


120 


ACAGGTGGAT 


TGCCGTTTGA 


CAGGTGAGAA 


GTATTTTGCC 


AGCAGCAAGA 


TACTTCTTAT 


180 


ATAATGAATA 


ATAATTTATT 


TAAACCGTCC 


TCTGAATGGG 


GCGAGCAGGA 


GTTTTTGAAT 


240 


GGAATTTGTT 


TACGACGTGG 


CAGAATCGGC 


AGTCAGCCCT 


GCGGTGATTA 


AAGTAATCGG 


300 


CTTGGGCGGC 


GGCGGTTGCA 


ATGCATCCAA 


TAACATGGTT 


GCCAACAATG 


TGCGCGGTGT 


360 


GGAGTTTATC 


AGTGCCAATA 


CGGATGCGCA 


GTCTCTGGCA 


AAAAACCATG 


CGGCGAAGAG 


420 


AATCCAGTTG 


GGTACGAATC 


TGACACGCGG 


TTTGGGCGCG 


GGCGCGAATC 


CCGATATCGG 


480 


CCGTGCGGCA 


GCCCAGGAAG 


ACCGGGAAGC 


CATTGAAGAA 


G C C ATT CG CG 


GTGCGAATAT 


540 


GCTGTTTATC 


ACGACCGGTA 


TGGGCGGCGG 


TACCGGTACC 


GGTTCCGCGC 


CGGTTGTTGC 


600 


TGAGATTGCC 


AAGTCTTTGG 


GCATTCTGAC 


CGTTGCCGTG 


GTTACCCGAC 


CGTTCGCATA 


660 


TGAAGGTAAG 


CGCGTCCATG 


TCGCACAGGC 


AGGGTTGGAA 


CAGTTGAAAG 


AACACGTCGA 


720 


TTCGCTGATT 


ATCATCCCGA 


ACGACAAACT 


GATGACTGCA 


TTGGGTGAAG 


ACGTAACGAT 


780 


GCGCGAAGCC 


TTCCGTGCCG 


CCG AC AATGT 


ATTGCGCGAT 


GCGGTCGCAG 


GCATTTCCGA 


840 
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AGTGGTAACT TGCCCGAGCG AAATCATCAA CCTCGACTTT GCCGACGTGA AAACCGTGAT 900 

GAGCAACCGC GGTATCGCTA TGATGGGTTC GGGTTATGCC CAAGGTATCG ACCGTGCGCG 960 

TATGGCGACC GACCAGGCCA TTTCCAGTCC GCTGCTGGAC GATGTAACCT TGGACGGAGC 1020 

GCGCGGTGTG CTGGTCAATA TTACGACTGC TCCGGGTTGC TTGAAAATGT CCGAGTTGTC 1080 

CGAAGTCATG AAAATCGTCA ACCAAAGCGC GCATCCCGAT TTGGAATGCA AATTCGGTGC 1140 

TGCTGAAGAC GAGACCATGA GCGAAGATGC CATCCGGATT ACCATTATCG CTACCGGTCT 1200 

GAAAGAAAAA GGCGCGGTCG ATTTTGTTCC GGCAAGGGAG GTAGAAGCGG TTGCCCCGTC 1260 

CAAACAGGAG CAAAGCCACA ATGTCGAAGG TAGATCCGCA CCAATCGCGG TATCCGCACG 1320 

ATGAACCTTA CCGCTGCGGA TTTCGACAAT CAGTCCGTAC TTGACGACTT GAAATCCCTG 1380 

CGATTTTGCG TCGTCAACAC AATTCAGACA AATAATGTGC TGTTTGCCCG TAAACCTGCT 1440 

GCCTCCCGAA TCGGTTTGTC CGGTTTGGGA GGTATGTTTT TCAAGATGTT GCAATTTCGT 1500 

ACGGTTTGCG GTCGGCGGAT TCAGATTTTT CCACTTGATA CAGACTTTCA GATATGGACA 15 60 

CTTCAAAACA AACACTGTTG GACGGGATTT TTAAGCTGAA GGCAAACGGT ACGACGGTGC 1620 

GTACCGAGTT GATGGCGGGT TTGACAACTT TTTTGACGAT GTGCTACATC GTTAATCGTC 1680 

AACC CTCTG A TTTTGGGCGA GACCGGCATG GATATGGGGG CGGTATTCGT CGCTACCTGT 1740 

ATCG CGTCTG CCAATCGGCT GTTTTGTTAT GGGTTTTGTC GGCAACTATC CGATTGCACT 1800 

CGCACCGGGG ATGGGGCTGA ATGCCTATTT CACCTTTGCC GTCGTTAAGG GTATGGGCTG 1860 

CCTTGGCAGG TTGCGTTGGG TGCGGTGTTC ATCTCCGGTC TG AT TT T CAT CCTGTTCAGC 1920 

TTTTTTAAAG TCAGGGAAAT GCTGTCAACG CACTGCCTAT GGGTTTGAAA ATGTCGATTG 1980 

CTGCCGGTAT CGGTTTGTTT TTGGCACTGA TTTCCCTGAA AGGCGCAGGC CAT T ATCG TT 2040 

GCCAATCCGG CAACCTTGGT CGGTTTGGGC GATATTCATC AGCCGTCCGC GTTGTTGGCA 2100 

CTGTTCGGTT TTGCTATGGT GGTCGTATTG GGACATTTCC GCGTTCAAGG CGCAACATCA 2160 

TCACCATCTT GACCATTACC GTCATTGCCA GCCTGATGGG TTTGAATGAA TTTCACGGCA 2220 

TCATCGGCGA AGTACCGAGC ATTGCGCCGA CTTTTATGCA GATGGATTTT GAAGGCCTGT 2280 

TTACCGTCAG CTGGTCAGTG ATTTTCGTCT TCTTCTTGGT CGATCTATTT GACAGTACCG 2340 

GAACGCTGGT CGGCATATCC CACCGTGCCG GGCTGCTGGT GGACGGTAAG CTGCCCCGCC 2400 

TGAAACGCGC ACTGCTTGCA GACTCTACCG CCATTATGGC AGGTGCGGCT TTGGGTACTT 2460 

CTTCCACCAC GCCTTATGTG GAAAGCGCGG CGGGCGTATC GGCAGGCGGA CGGACCGGCC 2520 

TGACGGCGGT TACCGTCGGC GTATTGATGC TCGCCTGCCT GATGTTTTCA CCTTTGGCGA 2580 

AAAGTGTTCC CGCTTTTGGC ACCGCGCCCG CCCTGCTTTA TGTCGGCACG CAGATGCTCC 2640 

GCAGTGCGAG GGATATTGAT TGGGACGATA TGACGGAAGC CGCACCCGCA TTCCTGACCA 2700 

TTGTCTTCAT GCCGTTTACC TATTCGATTG CAGACGGCAT CGCCTTCGGC TTCATCAGCT 2760 

ATGCCGTGGT TAAACTTTTA TGCCGCCGCA CCAAAGACGT TCCGCCTATG GAATGGGTTG 2820 

TTGCCGTATT GTGGGCACTG AAATTCTGGT ATTTGGGCTG ATTGATTCGA TATTAAAAAT 2880 
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GCCGTCTGAA 


AGGTTTTCAG 


ACGGCATTTT 


GTTTGCCGAT 


ATATTAATTT 


TTATTAAATT 


2940 


ATATAAAAAT 


CAAATACATA 


ATAAAATACA 


TCGGATTGCT 


TAAAAATAAT 


ACATTGTTTT 


3000 


TTATGTATAA 


AATATTTTAT 


AAGTTTTCAG 


GATTTGGATT 


ATTGAAAATT 


TTTCTTGATT 


3060 


TCCTGACAAT 


TTTATTGAAA 


CAAATAATTC 


AAAATTAATC 


TAGTTTAATC 


ATAGAATTAA 


3120 


AATAAAATAT 


TAAAATTATG 


TAATGAGTCT 


CCTTAAAAAT 


GTTTGACATT 


TTCAGTCTTG 


3180 


TGTTTTAGAT 


TATCGAAAAA 


TAAAACTACA 


TAACACTACA 


AAGGAATATT 


ACTATGAAAC 


3240 


CAATTCAGAT 


GTTTTCCCCT 


TTTCTGAATA 


ATCCCCTTGT 


TTTCTTCTTG 


TCTGCGGTTT 


3300 


TGCCGCATAA 


TTCCGAACGG 


TCTGCTGTTT 


TTCTTTGATT 


CGTTTTAAAT 


ATCAATAAGA 


3360 


TAATTTTTCC 


CATATATTTT 


TAATGATTGG 


ATTGGGATGC 


CCGACGCGTC 


GGATGGCTGT 


3420 


GTTTTGCCGT 


CCGAATGTGA 


TGGAAGCCTG 


TCCATACTGA 


AAAAAAGTCT 


ATANAGGAGA 


3480 


AATATGATGA 


GTCAACACTC 


TGCCGGAGCA 


CGTTTCCGCC 


AAGCCGTGAA 


AGAATCGAAT 


3540 


CCGCTTCCCG 


TCGCCGGTTG 


CGTCAATGCT 


TATTTTGCAC 


GATTGGCCAC 


CCAAAGCGGT 


3600 


TTCAAAGCCA 


TCTATCTGTC 


TGGCGGCGGC 


GTGGCAGCCT 


GTTCTTG CGG 


TATCCCTGAT 


3660 


TTGGGCATTA 


C C AC AATGG A 


AGATGTGCTG 


ATCGACGCAC 


GACGCATTAC 


GGAGAACGTG 


3720 


GATNCGCCTC 


TGCTGGTGGA 


CATCGATGTG 


GGTTGGGGCG 


GTGCATTCAA 


TATTGCCCGT 


3780 


ACCATTCGCA 


ACTTTGAACG 


CGCCGGTGTT 


GCAGCGGTTC 


ACATCGAAGA 


TCAGGTAGCG 


3840 


CAAAAACGCT 


GCGGTCACCG 


TCCGAACAAA 


GCCATTGTTA 


TCTNAAGATG 


NAATGGTCGA 


3900 


CCGTAT C AAA 


GCTGCCGTAG 


ATGCGCGCGT 


TGNTGNGAAC 


TTCGTGATTA 


TGGCGCGTAC 


3960 


CGATGCGCTG 


GCGGTAGAAG 


GTTTGGATGC 


CGCTATCGAA 


CGCGCCCAAG 


CTTGTGTCGA 


4020 


AAGCCGGTGC 


GGACATGATT 


TTCCCTGAAG 


CCATGACCGA 


TTTGAACATG 


TACCGCCAAT 


4080 


TTGCAGATGC 


GGTGAAAGTG 


CGTGTTGGCG 


AACATTACCG 


AGTTTGGTTC 


CACTCCGCTT 


4140 


TATACCCAAA 


GCGAGCTGGC 


TGAAAACGGC 


GTGTCGCTGG 


TGCTGTATCC 


GCTGTCATCG 


4200 


TTCCGTGCAG 


CAAGCAAAGC 


CGCTCTGAAT 


GTTTACGAAG 


CGATTATGCG 


CGATGGCACT 


4260 


CAGGCGGCGG 


TGGTGGACAG 


TATGCAAACC 


CGTGCCGAGC 


TGTACGAGCA 


TCTGAACTAT 


4320 


CATGCCTTCG 


AGCAAAAACT 


GGATAAATTG 


TTT CAAAAAT 


GATTTACCGC 


TTTCAGACGG 


4380 


TCTTTCAACA 


AATCCGCATC 


GGTCGTCTGA 


AAACCCGAAA 


CCCATAAAAA 


CACAAAGGAG 


4440 


AAATACCATG 


ACTGAAACTA 


CTCAAACCCC 


GACCTTCAAA 


CCTAAGAAAT 


CCGTTGCGCT 


4500 


TTCAGGCGTT 


GCGGCCGGTA 


ATACCGCTTT 


GTGTACCGTT 


GGCCGCACCC 


GGCAACGATT 


4560 


TGCAGCTATC 


GCGGTTACGA 


CATCTTGGAT 


TTGGGCACAA 


AAATGCG TTT 


GAAGAAGTAG 


4620 


CCCACCTGCT 


GATTCACGGT 


CATCTGCCCA 


ACAAATTCGA 


CGTGGAAGCT 


TATAAAAGGA 


4680 


AGCTCAAATC 


CATGCGCGGC 


CTGCCTATCC 


GTGTATTAAA 


GTTTTGGGAA 


AGCCTGCCTG 


4740 


CACATACCCA 


TCCGGATGGA 


CGGTAATGGC 


GTACCGGCGG 


TATCCATGCT 


GGGCTGCGTT 


4800 


CATCCCGAAC 


GTGAAAGCCA 


TCCCGGAAAG 


TGAAGCGCGC 


GACATCGCCG 


ACAAACTGAT 


4860 


TGCAGCCTCG 


GAGCCTCCTG 


CTGTACTNGG 


TATCAATATC 


GCACAACGGC 


AAACG C ATTG 


4920 
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AGTTGAAGCG 


ACGAGAGACA 


TCGGCGGTCA 


TTTCCTGCAA 


CTGTTNCACG 


GCAACGCCCA 


4980 


AGCGATCACA 


CATCAAAGCC 


ATGCACGTTT 


CACTGATTCT 


GTATGCGAAC 


ACGAGTTCAA 


5040 


CGTTCTACCT 


TTACCGTTTG 


CCGTTCTTCT 


GGTCGGTTCT 


AGCCCTGTAA 


AAAGAGAAGG 


5100 


TTGTTAGCTG 


GCGAAGGTTT 


GCAGCCGTTA 


CAGTTTCCCG 


CGTTATAGCG 


GCCAAGAAAC 


5160 


GAGTTTGGCG 


CACGGTGAGA 


ATTACCTGTT 


GCAACGCCCC 


AGCCTTTACC 


ATATGTGGGC 


5220 


CTACTGGCTT 


NGGCTAGTGC 


TAAGAAACGC 


GGCTATGCTA 


GCGCCTACAT 


GCCGAGTGAC 


5280 


GAGCGTNACG 


CCATCGCAAA 


ACTTATACGC 


ATTTCGGGAA 


GCCAANCGCT 


GGCGGCACAA 


5340 


AG CCTGG AT A 


GTTGTGCGGC 


TAACGNGGCC 


ATTACGACCT 


CATGTATAGT 


CCTCTGACAT 


5400 


GGCGCTANTT 


GCGCCC 










5416 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
- (A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA 



( ix ) FEATURE : 

(A) NAME /KEY : misc_RNA 

(B) LOCATION: 1..16 

(D) OTHER INFORMATION: /note= "consensus target sequence for 

hairpin ribozyme*' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
NNNSNGUCNN NNNNNN 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1. .23 

(D) OTHER INFORMATION: /note= "oligonucleotide HEICl" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AGCTTGAGGT CGACGGGATA TCG 



23 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1. .23 

(D) OTHER INFORMATION: /note= "oligonucleotide HEIC2 " 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
AATTCGATAT CCCGTCGACC TCA 23 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..20 

(D) OTHER INFORMATION: /note= "primer B800F1" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CACATAAGGC GTGGTGGAAG 20 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1. .20 

(D) OTHER INFORMATION: /note= "target sequence for 

primer 800F8" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 : 
CTCCCAAACC GGACAAACCG 



20 



WO 98/17805 



PCT/US97/19424 



75 



(2) INFORMATION FOR SEQ ID NO: 14: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..22 

(D) OTHER INFORMATION: /note= "target sequence for 

primer 800F9 M 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GTCAAGTACG GACTGATTGT CG 22 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

( B ) TYPE; amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gly Gly Gly Thr Gly Thr Gly 
1 5 
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WHAT IS CLAIMED IS: 



1 1. An isolated nucleic acid encoding a polypeptide selected from the 

2 group of polypeptides consisting of ORF 1 (SEQ ID NO: 1), ORF 2a (SEQ ID NO:4), ORF 

3 2b (SEQ ID NO:5), and ORF 3 (SEQ ID NO:6); and, conservatively modified variations 

4 thereof. 

1 2. The nucleic acid of claim 1, wherein the nucleic acid is selected from 

2 the group consisting of Seq 1 (SEQ ID NO:l), Seq 2 (SEQ ID NO:3), and Seq 3 (SEQ ID 

3 NO:6). 

1 3. An isolated nucleic acid which hybridizes under stringent conditions 

2 to a nucleic acid selected from the group consisting of Seq 1 (SEQ ID NO: 1), Seq 2 (SEQ 

3 ID NO:3), and Seq 3 (SEQ ID NO:6). 

1 4. The isolated nucleic acid of claim 3 wherein the nucleic acid is 

2 between about 20 and about 25 nucleotides in length. 

3 5. The nucleic acid of claim 3, wherein the nucleic acid is selected from 

4 the group consisting of Seq 1 (SEQ ID NO:l), Seq 2 (SEQ ID NO:3), and Seq 3 (SEQ ID 

5 NO:6). 

1 6. An isolated nucleic acid which hybridizes under stringent conditions 

2 to Seq 4 (SEQ ID NO:8), 

1 1. The nucleic acid of claim 6, wherein the nucleic acid is selected from 

2 the group consisting of Seq 1 (SEQ ID NO: 1), Seq 2 (SEQ ID NO:3), and Seq 3 (SEQ ID 

3 NO:6). 

1 8. The nucleic acid of claim 6, wherein the nucleic acid further comprises 

2 a recombinant expression cassette, wherein a nucleic acid subsequence of the expression 

3 cassette hybridizes under stringent conditions to Seq 4 (SEQ ID NO: 8) and wherein the 

4 nucleic acid subsequence is operably linked to a promoter. 
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5 9. The nucleic acid of claim 8, wherein the nucleic acid, when transduced 

6 into a cell, expresses a polypeptide selected from the group of polypeptides consisting of 

7 ORF 1 (SEQ ID NO:2), ORF 2a (SEQ ID NO:4), ORF 2b (SEQ ID NO:5), and ORF 3 

8 (SEQ ID NO:7). 

1 10. A polypeptide encoded by the nucleic acid of claim 3. 

2 11. The polypeptide of claim 10, wherein the polypeptide is selected from 

3 the group consisting of ORF 1 (SEQ ID NO:2), ORF 2a (SEQ ID NO:4), ORF 2b (SEQ ID 

4 NO:5), and ORF 3 (SEQ ID NO:7). 

1 12. An antibody which specifically binds to the polypeptide of claim 10, 

2 wherein the antibody does not bind E coll FtsZ polypeptide, and does not bind to E coli 

3 UNK polypeptide. 

1 13. The antibody of claim 12, wherein the antibody specifically binds to 

2 a polypeptide selected from the group consisting of ORF 1 (SEQ ID NO: 2), ORF 2a (SEQ 

3 ID NO:4), ORF 2b (SEQ ID NO:5), and ORF 3 (SEQ ID NO:7). 

1 14. An immunogenic composition comprising an antigenic epitope derived 

2 from the polypeptide of claim 10. 

1 15. The immunogenic composition of claim 14, wherein the antigenic 

2 epitope is a polypeptide of claim 10. 

1 16. The immunogenic composition of claim 14, wherein the antigenic 

2 epitope is incorporated into a fusion polypeptide. 

1 17. The immunogenic composition of claim 14, wherein the antigenic 

2 epitope is expressed on the surface of a viral vector. 

1 18. The immunogenic composition of claim 14, wherein the immunogenic 

2 composition further comprises an adjuvant. 
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3 19, The immunogenic composition of claim 14, wherein the immunogenic 

4 composition, when administered to a mammal, provides an immune response against the 

5 antigenic epitope. 

1 20. The immunogenic composition of claim 19, wherein administration of 

2 the immunogenic composition inhibits invasion of the cells of the mammal by Neisseria 

3 meningitidis. 

1 21. A amplification reaction mixture comprising: 

2 a template nucleic acid which hybridizes to Seq 4 (SEQ ID NO: 8) under stringent 

3 conditions; and, 

4 a primer pair which hybridizes the template nucleic acid, wherein the primers, when 

5 hybridized to the template, are competent to initiate primer extension by a polymerase. 

1 22, The amplification mixture of claim 21, wherein the amplification 

2 reaction mixture is a PCR reaction mixture, and wherein the polymerase is a thermostable 

3 polymerase. 

1 23. An isolated Neisseria meningitidis diplococcus which has a reduced 

2 ability to invade tissue culture epithelial cells in vitro as compared to a wild-type Neisseria 

3 meningitidis diplococcus, wherein the genome of the isolated Neisseria meningitidis 

4 diplococcus comprises a mutation in the region of the genome corresponding to Seq 4 (SEQ 

5 ID NO: 8). 

1 24. The isolated Neisseria meningitidis diplococcus of claim 23, wherein 

2 the diplococcus comprises a transposon insertion in the region of the genome corresponding 

3 to Seq 4 (SEQ ID NO:8). 

1 25. A method of detecting a Neisseria meningitidis nucleic acid in a 

2 biological sample, wherein the method comprises contacting a probe nucleic acid to the 

3 sample and detecting binding of the nucleic acid to the Neisseria meningitidis nucleic acid, 

4 wherein the probe nucleic acid hybridizes to Seq 4 (SEQ ID NO: 8). 
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1 26. The method of claim 25, wherein 

2 the probe nucleic acid is an amplification primer; 

3 the method further comprises contacting a second amplification primer to the sample 

4 and making an amplicon by amplifying the Neisseria meningitidis nucleic acid; and, 

5 the step of detecting binding of the nucleic acid to the Neisseria meningitidis nucleic 

6 acid comprises detecting the amplicon. 

1 27, The method of claim 25, wherein binding of the probe nucleic acid to 

2 the Neisseria meningitidis nucleic acid is detected using an assay selected from the group of 

3 assays consisting of a Southern assay and a northern assay. 

4 28. A method of inhibiting invasion of a mammalian cell by Neisseria 

5 meningitidis comprising expressing in the mammalian cell an antisense RNA molecule which 

6 binds to the nucleic acid of claim 3. 

1 29. The method of claim 28, wherein the antisense RNA molecule is a 

2 ribozyme. 
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TGCAGGCATGCAAGCTGGAAGGAAACTTGCCGCAGCCAGGAAAACGGTGC 

AGTGCAAGAGAGGGAAGGGGGCGGCGGTTTGTTGGCAAGATTGAAACGGT 

GGATTGAAAACAGCTTCTGAACAGGTGGATTGCCGTTTGACAGGTGAGAAG 

TATTTTGC C AGC AGC AAGATAC TTC TTATATAATGAATAATAATTTATTTAA 

RBS ORF1 
ACCGTCCTCTGAATGGGGCGAGCASGAGTTTTTGAATGGAATTTGTTTACGA 

CGTGGCAGAATCCGGCAGTCCGACCTGCGGTGATTAAAGTAATCCGGCTTGG 

GCGGCCGGCGGTTGGCAATGCAATCAATAACATGGTTGCCAACAATGTGCGC 

GGTGTGGAGTTTATCAGTGCCAATAACGGATGCGCAGTCTCTGGCAAAAAAC 

CATGCGGCGAAGAGAATCCAGTTGGTTACGAATCTGACACGCGGTTTGGGCG 

CGGCGNAATTCCCGATATCGGCCGTGCCGGAGCCCAGGAAGACGGGAAGCCA 

TTGAGAAGAAGCATTCGCGGTGCGAATTTGCTGTTTATCACGACCGGTATGG 

GCGGCGGTACCGGTACCGGTTCCGCGCCGTTGTTGCTGAGATTGCAAGTC 

GGGCATCTGACCGTTGCCGTGGTTACCCGACCGTTCGCATTTGAAGGGTAAT 

GCCGCGTCCAGGTCGCACAGCCAGGTTGGACAGTTGAAGAACACGTCGATTC 

GCTGATTATCATCCCGAACGACAAACTGATGACTGCATGGGTGAAGACGTAA 

CGATGCGCGACGCTTCCGTGCCGCCGACAATGTTTGCGCGATGCGGTCGAGG 

CATTCCGGAAGTGGTAACTTGCCGAGCGAAATCATCCAACCTCGACTTTTGCC 

GACGTGAAAACCGTGATGAGCAACCGCGGTATCGCTATGATGGGTTCGGGTT 

ATGCCCAAGGTATCCGACCGTGCGCGTATGGCGACCXjACCAGGCCATTTCCA 

GTCCGCTGCTGGACGATGTAACCTTGGACGGAGCGCGCGGTGTGCTGGTCAA 

TATTACGACTGCTCCGGGTTGCTTGAAAATGTCCGAGTTGTCCGAAGTCATGA 

FIGURE 4A 
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AAATCGTCAACCAAAGCGCGCATCCCGATTTGGAATGCAAATTCGGTGCTGCT 

GAAGACGAGACCATGAGCGAAGATGCCATCCGGATTACCATTATCGCTACCGG 

TCTGAAAGAAAAAGGCGCGGTCGATTTTGTTCCGGCAAGGGAGGTAGAAGCGG 

TTGCCCCGTCCAAACAGGAGCAAAGCCACAATGTCGAAGGTATGATCCGCACCA 

ATCGCGGTATCCGCACGATGAACCTTACCGCTGCGGATTTCGACAATCAGTCCGT 

END OF ORF1 

ACTTGACGACTTTGAAATCCTGCGATTTTGCGTCGTCAACACAATTCAGACAAA 
TAATGTGCTGTTTGCCCGTAAACCTGCTGCCTCCCGAATCGGTTTGTCCGGTTTGG 
GAGGTATGTTTTTCAAGATGTTGCAATTTCGTACGGTTTGCGGTCGGCGGATTCAG 
ATTTTTCCACTTGATACAGACTTTCAGATATGGACACTTCAAAACAAACACTGTTG 
GAC GGG ATTTTT AAGC TG AAGGC AAACGGTAC GACGGTGCGTAC CGAGTTG ATGG 
CGGGTTTGACAACTTTTTTGACGATGTGCTACATCGTTAATCGTCAACCCTCTGATT 
TTGGGCGAGACCGGCATGGATATGGGGGCGGTATTCGTCGCTACCTGTATCGCGTC 
TGCCAATCGGCTGTTTTGTTATGGGTTTTGTCGGCAAC 
CGGGGATGGGGCTGAATGCCTATTTCACCTTTGCCGTCGTTAAGGGTATC 
CTTGGCAGGTTGCGTTGGGTGCGGTGTTCATCTCCGGTCTGATTTTCATCCCTGTTCA 

*ORF2 **ORF2 

GCTTTTTTAAAGTCAGGGAAATGCTGTCAACGCACTGCCTATGGGTTTGAAAATGTC 

GATTGCTGCCGGTATCGGTTTGTTTTTGGCACTGATTTCCCTGAAA 

TTATCGTTGCCAATCCGGCAACCTTGGTCGGTTTGGGCGATATTCATCAGCCGTCCG 

CGTTGTTGGCACTGTTCGGTTTTCGTATGGTGGTCGTATTGGGACATTTCCGCGTTC 

AAGGCGCAACATCATCACCATCTTGACCATTACCGTCATTGCCAGCCTGATGGGTTT 

GAATGAATTTCACGGCATCATCGGCGAAGTACCGAGCATTGCGCCGACTTTTATGCA 

GATGGATTTTGAAGGCCTGTTTACCGTCAGCTGGTCAGTGATTTTCGTCTTCTTCTTG 

GTCGATCTATTTGACAGTACCGGAACGCTGGTCGKjC^TATCCCACCGTGCCGGGCTG 

FIGURE 4B 
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CTGGTGGACGGTAAGCTGCCCCGCCTGAAACGCGCACTGCTTGCAGACTCTACCGCC 
ATTATGGCAGGTGCGGCTTTGGGTACTTCTTCCACCACGCCTTATGTGGAAAGCGC 
GGCGGGCGTATCGGCAGGCGGACGGACCGGCCTGACGGCGGTTACCGTCGGCGTA 
TTGATGCTCGCCTGCCTGATGTTTTCACCTTTGGCGAAAAGTGTTC 
ACCGCGCCCGCCCTGCTTTATGTCGGCACGCAGATGCTCCGCAGTGCGAGGGATAT 
TGATTGGGACGATATGACGGAAGCCGCACCCGCATTCCTGACCATTGTCTTCATGCC 
GTTTACCTATTCGATTGCAGACGGCATCGCCTTCGGCTTCATCAGCTATGCCGTGGT 
TAAACTTTTATGCCGCCGCACCAAAGACGTTCCGCCTATGGAATGGGTTGTTGCCGT 

END RF ORF2 

ATTGTGGGCACTGAAATTCTGGTATTTGGGCTGATTGATTCGATATTAAAAATGCCG 
TCTGAAAGGTTTTCAGACGGCATTTTGTTTGCCGATATATTAATTTTTATTAAATTAT 
ATAAAAATCAAATACATAATAAAATACATCGGATTGCTTAAAAATAATACATTGTTT 
TTTATGTATAAAATATTTTATAA 

ATTTCC TGAC AATTTTATTGAAAC AAATAATTC AAAATTAATC T AGTTT AATC ATAG A 
ATTAAAATAAAATATTAAAATTATGTAATGAGTC 

AGTCTTGTGTTTTAGATTATCGAAAAATAAAACTACATAACACTACAAAGGAATATT 
ACTATGAAACCAATTCAGATGTTTTCCCCTTTTCTGAATAATCCCCTTGTTTTCTTC 
GTCTGCGGTTTTGCCGCATAATTCCGAACGGTCTGCTGTTTTTCTTTGATTCGTTTTAA 

RBS ORF3 

ATATCAATAAGATAATTTTTCCCATATATTTTTAATGATTGGATTGGGATGCCCGACG 

CGTCGGATGGCTGTGTTTTGCCGTCCGAATGTGATGGAAGCCTGTCCATACTGAA^ 

AAAGTCTAT AAAGG AG AAATATG ATG AG TC AACACTC TGC CGGAGC ACGTTTCC GCC 

AAGCCGTGAAAGAATCGAATCCGCTTGCCGTCGCCGGTTGCGTCAATGCTTATTTTG 

CACGATTGGCCACCCAAAGCGGTTTCAAAGCCATCTATCTGTCTGGCGGCGGCGTGG 

CAGCCTGTTCTTGCGGTATCCCTGATTTGGGCATTACCACAATGGAAGATGTGCTGAT 

FIGURE 4C 
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CGACGCACGACGCATTACGGACAACGTGGATNCGCCTCTGCTGGTGGACATCGATGT 
GGGTTGGGGCGGTGCATTCAATATTGCCCGTACCATTCGCAACTTTGAACGCGCCGG 



END OF ORF3 

TGTTGCAGCGGTTCACATCGAAGATCAGGTAGCGCAAAAACGCTGCGGTCACCGTCC 
GAACAAAGCCATTGTTATCTNAAGATGNAATGGTCGACCGTATCAAAGCTGCCGTAG 
ATGCGCGCGTTGNTGNGAACTTCGTG^ 

AAGGTTTGGATGCCGCTATCGAACGCGCCCAAGCTTGTGTCGAAAGCCGGTGCGGAC 
ATGATTTTCCCTGAAGCCATGACCGATTTGAACATGTACCGCCAATTTGCAGATGCG 
GTGAAAGTGCGTGTTGGCGAACATTACCGAGTTTGGTTCCACTCCGCTTTATACCCAA 
AGCGAGCTGGCTGAAAACGGCGTGTCGCTGGTGCTGTATCCGCTGTCATCGTTCCGT 
GCAGCAAGCAAAGCCGCTCTGAATGTTTACGAAGCGATTATGCGCGATGGCACTCAG 
GCGGCGGTGGTGGACAGTATGCAAACCCGTGCCGAGCTGTACGAGCATCTGAACTAT 
CATGCCTTCGAGCAAAAACTGGATAAATTGTTTCAAAAATGATTTACCGCTTTCAGAC 
GGTCTTTCAACAAATCCGCATCGGTCGTCTGAAAACCCGAAACCCATAAAAACACAA 
AGGAGAAATACCATGACTGAAACTACTCAAACCCCGACCTTCAAACCTAAGAAATCC 
GTTGCGCTTTCAGGCGTTGCGGCCGGTAATACCGCTTTGTGTACCGTTGGCCGCACCC 
GGCAACGATTTGGAGCTATCGCGGTTACGACATCTTGGATTTGGGCACAAAAATGCG 
TTTGAAGAAGTAGCCCACCTGCTGATTCACGGTCATCTGCCCAACAAATTCGACGTG 
GAAGC T TATAAAAGGAAGC TC AAATCC ATGCGCGGCCTGC C TATCCGTGT ATTAAAG 
TTTTGGGAAAGCCTGCCTGCACATACCCATCCGGATGGACGGTAATGGCGTACCGGC 
GGTATCCATGCTGGGCTGCGTTCATCCCGAACGTGAAAGCCATCCCGGAAAGTGAAG 
CGCGCGACATCGCCGACAAACTGATTGCAGCCTCGGAGCCTCCTGCTGTACTNGGTAT 
CAATATCGCACAACGGCAAACGCATTGAGTTGAAGCGACGAGAGACATCGGCGGTC 

FIGURE 4D 
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ATTTCCTGCAACTGTTNCACGGCAACGCCCAAGCGATCACACATCAAAGCCATGCAC 

GTTTCACTGATTCTGTATGCGAACACGAGTTCAACGTTCTACCTTTACCGTTTGCCGT 

TCTTCTGGTCGGTTCTAGCCCTGTAAAAAGAGAAGGTTGTTAGCTGGCGAAGGTTTG 

CAGCCGTTACAGTTTCCCGCGTTATAGCGGCCAAGAAACGAGTTTGGCGCACGGTGA 

GAATTACCTGTTGCAACGCCCCAGCCTTTACCATATGTGGGCCTACTGGCTTNGGCTA 

GTGCTAAGAAACGCGGCTATGCTAGCGCCTACATGCCGAGTGACGAGCGTNACGCCA 

TCGCAAAACTTATACGCATTTCGGGAAGCCAANCGCTGGCGGCT^CAAAGCCTGGATA 

GTTGTGCGGCTAACGNGGCCATTACGACCTCATGTATAGTCCTCTGACATGGCGCTA 

NTTGCGCCC 

FIGURE 4E 
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11 20 29 38 * 47 ** 56 

5' TTT TAA AGT CAG GGA AAT GCT GTC AAC GCA CTG CCT ATG GGT TTG AAA ATG TCG 
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551 560 569 578 587 596 

TCC ACC ACG CCT TAT GTG GAA AGC GCG GCG GGC GTA TCG GCA GGC GGA CGG ACC 

STTPYVESAAGVSAGGRT 

605 614 623 632 641 650 

GGC CTG ACG GCG GTT ACC GTC GGC GTA TTG ATG CTC GCC TGC CTG ATG TTT TCA 

GLTAVTVGVLMLAC LMF S 

659 668 677 686 695 704 

CCT TTG GCG AAA AGT GTT CCC GCT TTT GGC ACC GCG CCC GCC CTG CTT TAT TGC 

PLAKSVPAFGTAPALLYV 

713 722 731 740 749 758 

GGC ACG CAG ATG CTC CGC AGT. GCG AGG GAT ATT GAT TGG GAC GAT ATG ACG GAA 

GTQMLRSARD IDWDDMTE 
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GCC GCA CCC GCA TTC CTG ACC ATT GTC TTC ATG CCG TTT ACC TAT TCG ATT GCA 
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DGIAFGFISYAVVKLLCR 
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929 938 947 956 

AAA TTC TGG TAT TTG GGC TGA TTG ATT CGA TAT TAA AAA T 3 ' 
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NDWIG MPDASDGCVLPSE 

64 73 82 91 100 109 

TGT GAT GGA AGC CTG TCC ATA CTG AAA AAA AGT CTA TAN AGG AGA AAT ATG ATG 



CDGSLSILKKSLXRRNMM 

118 127 136 145 154 163 

AGT CAA CAC TCT GCC GGA GCA CGT TTC CGC CAA GCC GTG AAA GAA TCG AAT CCG 



S QHSAGARFRQAVKESNP 

172 181 190 199 208 217 

CTT GCC GTC GCC GGT TGC GTC AAT GCT TAT TTT GCA CGA TTG GCC ACC CAA AGC 
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ATT ACG GAC AAC GTG GAT NCG CCT CTG CTG GTG GAC ATC GAT GTG GGT TGG GGC 
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GGT GCA TTC AAT ATT GCC CGT ACC ATT CGC AAC TTT GAA CGC GCC GGT GTT GCA 
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10 20 30 40 50 

1 T 50 

1 T 50 

1 _ _ „ 50 

1 TGCAGGCATG CAAGCTGGAA GGAAACTTGC CGCAGCCAGG AAAACGGTGC 50 

60 70 80 90 100 

51 10Q 

51 1Q0 

51 10Q 

51 AGTGCAAGAG AGGGAAGGGG GCGGCGGTTT GTTGGCAAGA TTGAAACGGT 100 

110 120 130 140 150 

ioi 150 

101 

ioi „.„ 150 

101 GGATTGAAAA CAGCTTCTGA ACAGGTGGAT TGCCGTTTGA CAGGTGAGAA 150 

160 170 180 190 200 

151 200 

151 2Q0 

151 200 

151 GTATTTTGCC AGCAGCAAGA TACTTCTTAT ATAATGAATA ATAATTTATT 200 

210 220 230 240 250 

201 250 

201 250 

201 — GAGCAGGA GTTTTTGAAT GGAATTTGTT 250 

2 01 TAAACCGTCC TCTGAATGGG GCGAGCAGGA GTTTTTGAAT GGAATTTGTT 250 

260 270 280 290 300 

251 300 

251 3 00 

251 TACGACGTGG CAGAATCGGC AGTCAGCCCT GCGGTGATTA AAGTAATCGG 300 

251 TACGACGTGG CAGAATCGGC AGTCAGCCCT GCGGTGATTA AAGTAATCGG 300 

310 320 330 340 350 

301 350 

301 3 50 

301 CTTGGGCGGC GGCGGTTGCA ATGCATCCAA TAACATGGTT GCCAACAATG 350 

3 01 CTTGGGCGGC GGCGGTTGCA ATGCATCCAA TAACATGGTT GCCAACAATG 350 

360 370 380 390 400 

351 400 

351 400 

351 TGCGCGGTGT GGAGTTTATC AGTGC CAATA CGGATGCGCA GTCTCTGGCA 400 

351 TGCGCGGTGT GGAGTTTATC AGTGC CAATA CGGATGCGCA GTCTCTGGCA 400 

410 420 430 440 450 

401 450 

401 450 

401 AAAAACCATG CGGCGAAGAG AATCCAGTTG GGTACGAATC TGACACGCGG 450 

401 AAAAACCATG CGGCGAAGAG AATCCAGTTG GGTACGAATC TGACACGCGG 450 
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460 470 480 490 500 

ORF3 451 500 

ORF2.SEQ 451 500 

451 TTTGGGCGCG GGCGCGAATC CCGATATCGG CCGTGCGGCA GCCCAGGAAG 500 

PATENT . SEQ 4 51 TTTGGGCGCG GGCGCGAATC CCGATATCGG CCGTGCGGCA GCCCAGGAAG 500 

510 520 530 540 550 

ORF3 501 550 

ORF2.SEQ 501 550 

ORF1 501 ACCGGGAAGC CATTGAAGAA GCCATTCGCG GTGCGAATAT GCTGTTTATC 55 0 

PATENT. SEQ 501 ACCGGGAAGC CATTGAAGAA GCCATTCGCG GTGCGAATAT GCTGTTTATC 550 

560 570 580 590 600 

ORF3 551 600 

ORF2.SEQ 551 600 

ORF1 551 ACGACCGGTA TGGGCGGCGG TACCGGTACC GGTTCCGCGC CGGTTGTTGC 600 

PATENT « SEQ 551 ACGACCGGTA TGGGCGGCGG TACCGGTACC GGTTCCGCGC CGGTTGTTGC 600 

610 620 630 640 650 

ORF3 601 650 

ORF2.SEQ 601 650 

ORF1 601 TGAGATTGCC AAGTCTTTGG GCATTCTGAC CGTTGCCGTG GTTACCCGAC 650 

PATENT. SEQ 601 TGAGATTGCC AAGTCTTTGG GCATTCTGAC CGTTGCCGTG GTTACCCGAC 650 

660 670 680 690 700 

ORF3 651 700 

ORF2.SEQ 651 700 

ORF1 651 CGTTCGCATA TGAAGGTAAG CGCGTCCATG TCGCACAGGC AGGGTTGGAA 700 

PATENT . SEQ 651 CGTTCGCATA TGAAGGTAAG CGCGTCCATG TCGCACAGGC AGGGTTGGAA 700 

710 720 730 740 750 

ORF3 701 75 0 

ORF2.SEQ 701 750 

ORF1 701 CAGTTGAAAG AACACGTCGA TTCGCTGATT ATCATCCCGA ACGACAAACT 750 

PATENT. SEQ 701 CAGTTGAAAG AACACGTCGA TTCGCTGATT ATCATCCCGA ACGACAAACT 750 

760 770 780 790 800 

ORF3 751 800 

ORF2.SEQ 751 80 0 

ORF1 751 GATGACTGCA TTGGGTGAAG ACGTAACGAT GCGCGAAGCC TTCCGTGCCG 800 

PATENT. SEQ 7 51 GATGACTGCA TTGGGTGAAG ACGTAACGAT GCGCGAAGCC TTCCGTGCCG 800 

810 820 830 840 850 

ORF3 801 850 

ORF2.SEQ 801 850 

ORF1 801 CCGACAATGT ATTGCGCGAT GCGGTCGCAG GCATTTCCGA AGTGGTAACT 850 

PATENT. SEQ 801 CCGACAATGT ATTGCGCGAT GCGGTCGCAG GCATTTCCGA AGTGGTAACT 850 

860 870 880 890 900 

ORF3 851 900 

ORF2 . SEQ 851 900 

ORF1 851 TGCCCGAGCG AAATCATCAA CCTCGACTTT GCCGACGTGA AAACCGTGAT 900 

PATENT . SEQ 851 TGCCCGAGCG AAATCATCAA CCTCGACTTT GCCGACGTGA AAACCGTGAT 900 

910 920 930 940 950 

ORF3 901 950 

ORF2.SEQ 901 950 

ORF1 901 GAGCAACCGC GGTATCGCTA TGATGGGTTC GGGTTATGCC CAAGGTATCG 950 

PATENT. SEQ 901 GAGCAACCGC GGTATCGCTA TGATGGGTTC GGGTTATGCC CAAGGTATCG 950 
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960 970 980 990 1000 

ORF3 951 10 00 

ORF2.SEQ 951 1000 

ORF1 951 ACCGTGCGCG TATGGCGACC GACCAGGCCA TTTCCAGTCC GCTGCTGGAC 1000 

PATENT. SEQ 9 51 ACCGTGCGCG TATGGCGACC GACCAGGCCA TTTCCAGTCC GCTGCTGGAC 1000 

1010 1020 1030 1040 1050 

ORF3 1001 1050 

ORF2.SEQ 1001 10 5Q 

ORF1 1001 GATGTAACCT TGGACGGAGC GCGCGGTGTG CTGGTCAATA TTACGACTGC 1050 

PATENT. SEQ 1001 GATGTAACCT TGGACGGAGC GCGCGGTGTG CTGGTCAATA TTACGACTGC 1050 

1060 1070 1080 1090 1100 

ORF3 1051 1100 

ORF2.SEQ 1051 1100 

ORF1 1051 TCCGGGTTGC TTGAAAATGT CCGAGTTGTC CGAAGTCATG AAAATCGTCA 1100 

PATENT. SEQ 1051 TCCGGGTTGC TTGAAAATGT CCGAGTTGTC CGAAGTCATG AAAATCGTCA 1100 

1110 1120 1130 1140 1150 

ORF3 1101 1150 

ORF2.SEQ 1101 1150 

ORF1 1101 ACCAAAGCGC GCATCCCGAT TTGGAATGCA AATTCGGTGC TGCTGAAGAC 1150 

PATENT. SEQ 1101 ACCAAAGCGC GCATCCCGAT TTGGAATGCA AATTCGGTGC TGCTGAAGAC 1150 

1160 1170 1180 1190 1200 

ORF3 1151 1200 

ORF2.SEQ 1151 1200 

ORF1 1151 GAGACCATGA GCGAAGATGC CATCCGGATT ACCATTATCG CTACCGGTCT 1200 

PATENT. SEQ 1151 GAGACCATGA GCGAAGATGC CATCCGGATT ACCATTATCG CTACCGGTCT 1200 

1210 1220 1230 1240 1250 

ORF3 1201 1250 

ORF2.SEQ 1201 1250 

ORF1 1201 GAAAGAAAAA GGCGCGGTCG ATTTTGTTCC GGCAAGGGAG GTAGAAGCGG 1250 

PATENT. SEQ 12 01 GAAAGAAAAA GGCGCGGTCG ATTTTGTTCC GGCAAGGGAG GTAGAAGCGG 1250 

1260 1270 1280 1290 1300 

ORF3 . 1251 1300 

ORF2.SEQ 1251 1300 

ORF1 1251 TTGCCCCGTC CAAACAGGAG CAAAGCCACA ATGTCGAAGG TAGATCCGCA 1300 

PATENT. SEQ 12 51 TTGCCCCGTC CAAACAGGAG CAAAGCCACA ATGTCGAAGG TAGATCCGCA 1300 

1310 1320 1330 1340 1350 

ORF3 1301 1350 

ORF2.SEQ 1301 1350 

ORF1 13 01 CCAATCGCGG TATCCGCACG ATGAACCTTA CCGCTGCGGA TTTCGACAAT 1350 

PATENT. SEQ 13 01 CCAATCGCGG TATCCGCACG ATGAACCTTA CCGCTGCGGA TTTCGACAAT 1350 

1360 1370 1380 1390 1400 

ORF3 1351 1400 

ORF2.SEQ 1351 1400 

ORF1 1351 CAGTCCGTAC TTGACGACTT GAAATCCCTG CGATTTTGCG TCGTCAACAC 1400 

PATENT. SEQ 1351 CAGTCCGTAC TTGACGACTT GAAATCCCTG CGATTTTGCG TCGTCAACAC 1400 
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1410 1420 1430 1440 1450 

ORF3 1401 1450 

ORF2.SEQ 1401 1450 

ORF1 1401 AATTCAG 1450 

PATENT. SEQ 1401 AATTCAGACA AATAATGTGC TGTTTGCCCG TAAACCTGCT GCCTCCCGAA 1450 

1460 1470 1480 1490 1500 

ORF3 1451 1500 

ORF2.SEQ 1451 1500 

ORF1 1451 1500 

PATENT. SEQ 1451 TCGGTTTGTC CGGTTTGGGA GGTATGTTTT TCAAGATGTT GCAATTTCGT 1500 

1510 1520 1530 1540 1550 

ORF3 1501 1550 

ORF2.SEQ 1501 1550 

ORF1 1501 1550 

PATENT. SEQ 1501 ACGGTTTGCG GTCGGCGGAT TCAGATTTTT CCACTTGATA CAGACTTTCA 1550 

1560 1570 1580 1590 1600 

ORF3 1551 1600 

ORF2.SEQ 1551 1600 

ORF1 1551 1600 

PATENT. SEQ 15 51 GATATGGACA CTTCAAAACA AACACTGTTG GACGGGATTT TTAAGCTGAA 1600 

1610 1620 1630 1640 1650 

ORF3 1601 1650 

ORF2.SEQ 1601 1 65 q 

ORF1 1601 1650 

PATENT . SEQ 1601 GGCAAACGGT ACGACGGTGC GTACCGAGTT GATGGCGGGT TTGACAACTT 1650 

1660 1670 1680 1690 1700 

ORF3 1651 1700 

ORF2.SEQ 1651 1700 

ORF1 1651 1700 

PATENT. SEQ 1651 TTTTGACGAT GTGCTACATC GTTAATCGTC AACCCTCTGA TTTTGGGCGA 1700 

1710 1720 1730 1740 1750 

ORF3 1701 1750 

ORF2.SEQ 1701 1750 

ORF1 1701 1750 

PATENT. SEQ 1701 GACCGGCATG GATATGGGGG CGGTATTCGT CGC TACCTGT ATCGCGTCTG 1750 

1760 1770 1780 1790 1800 

ORF3 1751 1800 

ORF2.SEQ 1751 1800 

ORF1 1751 1800 

PATENT. SEQ 1751 CCAATCGGCT GTTTTGTTAT GGGTTTTGTC GGCAACTATC CGATTGCACT 1800 

1810 1820 1830 1840 1850 

ORF3 1801 1850 

ORF2.SEQ 1801 1850 

ORF1 1801 1850 

PATENT. SEQ 1801 CGCACCGGGG ATGGGGCTGA ATGCCTATTT CACCTTTGCC GTCGTTAAGG 1850 

1860 1870 1880 1890 1900 

ORF3 1851 1900 

ORF2.SEQ 1851 1900 

ORF1 1851 1900 

PATENT. SEQ 1851 GTATGGGCTG CCTTGGCAGG TTGCGTTGGG TGCGGTGTTC ATCTCCGGTC 1900 
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1910 1920 1930 1940 1950 

ORF3 1901 X950 

0RF2.SEQ 1901 -TTTTTAAAG TCAGGGAAAT GCTGTCAACG 1950 

ORF1 1901 19 50 

PATENT. SEQ 19 01 TGATTTTCAT CCTGTTCAGC TTTTTTAAAG TCAGGGAAAT GCTGTCAACG 1950 

1960 1970 1980 1990 2000 

ORF3 1951 2000 

ORF2.SEQ 1951 CACTGCCTAT GGGTTTGAAA ATGTCGATTG CTGCCGGTAT CGGTTTGTTT 2000 

ORF1 1951 2000 

PATENT. SEQ 1951 CACTGCCTAT GGGTTTGAAA ATGTCGATTG CTGCCGGTAT CGGTTTGTTT 2000 

2010 2020 2030 2040 2050 

ORF3 2001 2050 

ORF2.SEQ 2001 TTGGCACTGA TTTCCCTGAA AGGCGCAGGC CATTATCGTT GCCAATCCGG 2050 

ORF1 2001 2050 

PATENT. SEQ 2001 TTGGCACTGA TTTCCCTGAA AGGCGCAGGC CATTATCGTT GCCAATCCGG 2050 

2060 2070 2080 2090 2100 

ORF3 2051 2100 

ORF2.SEQ 2051 CAACC TTGGT CGGTTTGGGC GATATTCATC AGCCGTCCGC GTTGTTGGCA 2100 

ORF1 2051 2100 

PATENT. SEQ 2051 CAACCTTGGT CGGTTTGGGC GATATTCATC AGCCGTCCGC GTTGTTGGCA 2100 

2110 2120 2130 2140 2150 

ORF3 2101 2150 

ORF2.SEQ 2101 CTGTTCGGTT TTGCTATGGT GGTCGTATTG GGACATTTCC GCGTTCAAGG 2150 

ORF1 2101 2150 

PATENT . SEQ 2101 CTGTTCGGTT TTGCTATGGT GGTCGTATTG GGACATTTCC GCGTTCAAGG 2150 

2160 2170 2180 2190 2200 

ORF3 2151 2200 

ORF2.SEQ 2151 CGCAACATCA TCACCATCTT GACCATTACC GTCATTGCCA GCCTGATGGG 2200 

ORF1 2151 2200 

PATENT. SEQ 2151 CGCAACATCA TCACCATCTT GACCATTACC GTCATTGCCA GCCTGATGGG 2200 

2210 2220 2230 2240 2250 

ORF3 2201 2250 

ORF2.SEQ 2201 TTTGAATGAA TTTCACGGCA TCATCGGCGA AGTACCGAGC ATTGCGCCGA 2250 

ORF1 2201 2250 

PATENT. SEQ 2201 TTTGAATGAA TTTCACGGCA TCATCGGCGA AGTACCGAGC ATTGCGCCGA 2250 

2260 2270 2280 2290 2300 

ORF3 2251 2300 

ORF2.SEQ 22 51 CTTTTATGCA GATGGATTTT GAAGGCCTGT TTACCGTCAG CTGGTCAGTG 2300 

ORF1 2251 2300 

PATENT. SEQ 22 51 CTTTTATGCA GATGGATTTT GAAGGCCTGT TTACCGTCAG CTGGTCAGTG 2300 

2310 2320 2330 2340 2350 

ORF3 2301 2350 

ORF2.SEQ 2301 ATTTTCGTCT TCTTCTTGGT CGATCTATTT GACAGTACCG GAACGCTGGT 2350 

ORF1 2301 2350 

PATENT. SEQ 23 01 ATTTTCGTCT TCTTCTTGGT CGATCTATTT GACAGTACCG GAACGCTGGT 2350 
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2360 2370 2380 2390 2400 

ORF3 2351 2400 

ORF2.SEQ 2 3 51 CGGCATATCC CACCGTGCCG GGCTGCTGGT GGACGGTAAG CTGCCCCGCC 2400 

ORF1 2351 2400 

PATENT. SEQ 23 51 CGGCATATCC CACCGTGCCG GGCTGCTGGT GGACGGTAAG CTGCCCCGCC 2400 

2410 2420 2430 2440 2450 

ORF3 2401 -— 2450 

ORF2.SEQ 2401 TGAAACGCGC AC TGCTTGC A GACTCTACCG CCATTATGGC AGGTGCGGCT 2450 

ORF1 2401 2450 

PATENT. SEQ 2 401 TGAAACGCGC ACTGCTTGCA GACTCTACCG CCATTATGGC AGGTGCGGCT 2450 

2460 2470 2480 2490 2500 

ORF3 2451 2500 

ORF2.SEQ 2451 TTGGGTACTT CTTCCACCAC GCCTTATGTG GAAAGCGCGG CGGGCGTATC 2500 

ORF1 2451 2500 

PATENT. SEQ 2451 TTGGGTACTT CTTCCACCAC GCCTTATGTG GAAAGCGCGG CGGGCGTATC 2500 

2510 2520 2530 2540 2550 

ORF3 2501 2550 

ORF2.SEQ 2 501 GGCAGGCGGA CGGACCGGCC TGACGGCGGT TACCGTCGGC GTATTGATGC 2550 

ORF1 2501 2550 

PATENT. SEQ 2 501 GGCAGGCGGA CGGACCGGCC TGACGGCGGT TACCGTCGGC GTATTGATGC 2550 

2560 2570 2580 2590 2600 

ORF3 2551 2600 

ORF2.SEQ 2551 TCGCCTGCCT GATGTTTTCA CCTTTGGCGA AAAGTGTTCC CGCTTTTGGC 2600 

ORF1 2551 2600 

PATENT. SEQ 2551 TCGCCTGCCT GATGTTTTCA CCTTTGGCGA AAAGTGTTCC CGCTTTTGGC 2600 

2610 2620 2630 2640 2650 

ORF3 2601 2650 

ORF2.SEQ 2601 ACCGCGCCCG CCCTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 2650 

ORF1 2601 2650 

PATENT. SEQ 2 601 ACCGCGCCCG CCCTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 2 650 

2660 2670 2680 2690 2700 

ORF3 2651 2700 

ORF2.SEQ 26 51 GGATATTGAT TGGGACGATA TGACGGAAGC CGCACCCGCA TTCCTGACCA 2700 

ORF1 2651 2700 

PATENT. SEQ 2651 GGATATTGAT TGGGACGATA TGACGGAAGC CGCACCCGCA TTCCTGACCA 2700 

2710 2720 2730 2740 2750 

ORF3 2701 2750 

ORF2.SEQ 2701 TTGTCTTCAT GCCGTTTACC TATTCGATTG CAGACGGCAT CGCCTTCGGC 2750 

ORF1 2701 2750 

PATENT. SEQ 2701 TTGTCTTCAT GCCGTTTACC TATTCGATTG CAGACGGCAT CGCCTTCGGC 2750 

2760 2770 2780 2790 2800 

ORF3 2751 2800 

ORF2.SEQ 2751 TTCATCAGCT ATGCCGTGGT TAAACTTTTA TGCCGCCGCA CCAAAGACGT 2800 

ORF1 2751 . 2800 

PATENT. SEQ 27 51 TTCATCAGCT ATGCCGTGGT TAAACTTTTA TGCCGCCGCA CCAAAGACGT 2800 

2810 2820 2830 2840 2850 

ORF3 2801 2850 

ORF2.SEQ 2801 TCCGCCTATG GAATGGGTTG TTGCCGTATT GTGGGCACTG AAATTCTGGT 2850 

ORF1 2801 2850 

PATENT. SEQ 2801 TCCGCCTATG GAATGGGTTG TTGCCGTATT GTGGGCACTG AAATTCTGGT 2 850 
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2860 2870 2880 2890 2900 

ORF3 2851 2900 

ORF2.SEQ 2851 ATTTGGGCTG ATTGATTCGA TATTAAAAAT 2900 

ORF1 2851 2900 

PATENT. SEQ 2 851 ATTTGGGCTG ATTGATTCGA TATTAAAAAT GCCGTC TGAA AGGTTTTCAG 2 900 

2910 2920 2930 2940 2950 

ORF3 2901 2950 

ORF2.SEQ 2901 2950 

ORF1 2901 2950 

PATENT. SEQ 2 901 ACGGCATTTT GTTTGCCGAT ATATTAATTT TTATTAAATT ATATAAAAAT 2950 

2960 2970 2980 2990 3000 

ORF3 2951 3000 

ORF2.SEQ 2951 3000 

ORF1 2951 3000 

PATENT. SEQ 2951 CAAATACATA ATAAAATACA TCGGATTGCT TAAAAATAAT ACATTGTTTT 3000 

3010 3020 3030 3040 3050 

ORF3 3001 3050 

ORF2.SEQ 3001 3050 

ORF1 3001 3050 

PATENT. SEQ 3001 TTATGTATAA AATATTTTAT AAGTTTTCAG GATTTGGATT ATTGAAAATT 3050 

3060 3070 3080 3090 3100 

ORF3 3051 3100 

ORF2.SEQ 3051 3100 

ORF1 3051 3100 

PATENT. SEQ 3051 TTTCTTGATT TCCTGACAAT TTTATTGAAA CAAATAATTC AAAATTAATC 3100 

3110 3120 3130 3140 3150 

ORF3 3101 3150 

ORF2.SEQ 3101 3150 

ORF1 3101 3150 

PATENT . SEQ 3101 TAGTTTAATC ATAGAATTAA AATAAAATAT TAAAATTATG TAATGAGTCT 3150 

3160 3170 3180 3190 3200 

ORF3 3151 3200 

ORF2.SEQ 3151 3200 

ORF1 3151 3200 

PATENT. SEQ 3151 CCTTAAAAAT GTTTGACATT TTCAGTCTTG TGTTTTAGAT TATCGAAAAA 3200 

3210 3220 3230 3240 3250 

ORF3 3201 3250 

ORF2.SEQ 3201 3250 

ORF1 3201 3250 

PATENT . SEQ 3201 TAAAACTACA TAACACTACA AAGGAATATT ACTATGAAAC CAATTCAGAT 3250 

3260 3270 3280 3290 3300 

ORF3 3251 3300 

ORF2.SEQ 3251 3300 

ORF1 3251 — . 3300 

PATENT. SEQ 32 51 GTTTTCCCCT TTTCTGAATA ATCCCCTTGT TTTCTTCTTG TCTGCGGTTT 33 00 

3310 3320 3330 3340 3350 

ORF3 3301 3350 

ORF2.SEQ 3301 3350 

ORF1 3301 3350 

PATENT. SEQ 3301 TGCCGCATAA TTCCGAACGG TCTGCTGTTT TTCTTTGATT CGTTTTAAAT 3350 
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3360 3370 3380 3390 3400 

ORF3 3351 -AATGATTGG ATTGGGATGC 3400 

ORF2.SEQ 3351 3400 

ORF1 3351 3400 

PATENT. SEQ 3351 ATCAATAAGA TAATTTTTCC CATATATTTT TAATGATTGG ATTGGGATGC 3400 

3410 3420 3430 3440 3450 

ORF3 3401 CCGACGCGTC GGATGGCTGT GTTTTGCCGT CCGAATGTGA TGGAAGCCTG 3450 

ORF2.SEQ 3401 3450 

ORF1 3401 3450 

PATENT. SEQ 3401 CCGACGCGTC GGATGGCTGT GTTTTGCCGT CCGAATGTGA TGGAAGCCTG 3450 

3460 3470 3480 3490 3500 

ORF3 3451 TCCATACTGA AAAAAAGTCT ATAAAGGAGA AATATGATGA GTCAACACTC 3500 

ORF2.SEQ 3451 3500 

ORF1 3451 3500 

PATENT. SEQ 3451 TCCATACTGA AAAAAAGTCT ATAAAGGAGA AATATGATGA GTCAACACTC 3500 

3510 3520 3530 3540 3550 

ORF3 3501 TGCCGGAGCA CGTTTCCGCC AAGCCGTGAA AGAATCGAAT CCGCTTGCCG 3550 

ORF2.SEQ 3501 3 550 

ORF1 3501 3550 

PATENT. SEQ 3 501 TGCCGGAGCA CGTTTCCGCC AAGCCGTGAA AGAATCGAAT CCGCTTGCCG 3550 

3560 3570 3580 3590 3600 

ORF3 3551 TCGCCGGTTG CGTCAATGCT TATTTTGCAC GATTGGCCAC CCAAAGCGGT 3600 

ORF2.SEQ 3551 360 0 

ORF1 3551 3600 

PATENT. SEQ 3551 TCGCCGGTTG CGTCAATGCT TATTTTGCAC GATTGGCCAC CCAAAGCGGT 3600 

3610 3620 3630 3640 3650 

ORF3 3 601 TTCAAAGCCA TCTATCTGTC TGGCGGCGGC GTGGCAGCCT GTTCTTGCGG 3650 

ORF2.SEQ 3601 3650 

ORF1 3601 3650 

PATENT. SEQ 3 601 TTCAAAGCCA TCTATCTGTC TGGCGGCGGC GTGGCAGCCT GTTCTTGCGG 3 650 

3660 3670 3680 3690 3700 

ORF3 3651 TATCCCTGAT TTGGGCATTA CCACAATGGA AGATGTGCTG ATCGACGCAC 3700 

ORF2 . SEQ 3651 3700 

ORF1 3651 3700 

PATENT. SEQ 3651 TATCCCTGAT TTGGGCATTA CCACAATGGA AGATGTGCTG ATCGACGCAC 3700 

3710 3720 3730 3740 3750 

ORF3 3701 GACGCATTAC GGACAACGTG GATNCGCCTC TGCTGGTGGA CATCGATGTG 3750 

ORF2.SEQ 3701 3750 

ORF1 3701 375O 

PATENT. SEQ 3701 GACGCATTAC GGACAACGTG GATNCGCCTC TGCTGGTGGA CATCGATGTG 3750 

3760 3770 3780 3790 3800 

ORF3 3751 GGTTGGGGCG GTGCATTCAA TATTGCCCGT ACCATTCGCA ACTTTGAACG 3800 

ORF2.SEQ 3751 3800 

ORF1 3751 3800 

PATENT. SEQ 3751 GGTTGGGGCG GTGCATTCAA TATTGCCCGT ACCATTCGCA ACTTTGAACG 3800 

3810 3820 3830 3840 3850 

ORF3 3801 CGCCGGTGTT GCAGCGGTTC ACATCGAAGA TCAGGTA 3850 

ORF2.SEQ 3801 3850 

ORF1 3801 3850 

PATENT. SEQ 3801 CGCCGGTGTT GCAGCGGTTC ACATCGAAGA TCAGGTAGCG CAAAAACGCT 3850 
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3860 3870 3880 3890 3900 

ORF3 3851 3900 

ORF2.SEQ 3851 3goo 

ORF1 3851 3goo 

PATENT. SEQ 3851 GCGGTCACCG TCCGAACAAA GCCATTGTTA TCTNAAGATG NAATGGTCGA 3 900 

3910 3920 3930 3940 3950 

ORF3 3901 3g50 

ORF2.SEQ 3901 3950 

ORF1 3901 3950 

PATENT. SEQ 3901 CCGTATCAAA GCTGCCGTAG ATGCGCGCGT TGNTGNGAAC TTCGTGATTA 3950 

3960 3970 3980 3990 4000 

ORF3 3951 4000 

ORF2.SEQ 3951 4000 

ORF1 3951 4000 

PATENT. SEQ 3951 TGGCGCGTAC CGATGCGCTG GCGGTAGAAG GTTTGGATGC CGCTATCGAA 4000 

4010 4020 4030 4040 4050 

ORF3 4001 — 4050 

ORF2.SEQ 4001 40 50 

ORF1 4001 4050 

PATENT. SEQ 4001 CGCGCCCAAG CTTGTGTCGA AAGCCGGTGC GGACATGATT TTCCCTGAAG 4050 

4060 4070 4080 4090 4100 

ORF3 4051 4100 

ORF2.SEQ 4051 4100 

ORF1 4051 4100 

PATENT . SEQ 4051 CCATGACCGA TTTGAACATG TACCGCCAAT TTGCAGATGC GGTGAAAGTG 4100 

4110 4120 4130 4140 4150 

ORF3 4101 4150 

ORF2.SEQ 4101 4150 

ORF1 4101 4150 

PATENT. SEQ 4101 CGTGTTGGCG AACATTACCG AGTTTGGTTC CACTCCGCTT TATACCCAAA 4150 

4160 4170 4180 4190 4200 

ORF3 4151 4200 

ORF2.SEQ 4151 4200 

ORF1 4151 4200 

PATENT. SEQ 4151 GCGAGCTGGC TGAAAACGGC GTGTCGCTGG TGCTGTATCC GCTGTCATCG 4200 

4210 4220 4230 4240 4250 

ORF3 4201 4250 

ORF2.SEQ 4201 4250 

ORF1 4201 4250 

PATENT . SEQ 4201 TTCCGTGCAG CAAGCAAAGC CGCTCTGAAT GTTTACGAAG CGATTATGCG 4250 

4260 4270 4280 4290 4300 

ORF3 4251 4300 

ORF2.SEQ 4251 4300 

ORF1 4251 4300 

PATENT. SEQ 4251 CGATGGCACT CAGGCGGCGG TGGTGGACAG TATGCAAACC CGTGCCGAGC 4300 
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4310 4320 4330 4340 4350 

ORF3 4301 4350 

ORF2.SEQ 4301 4350 

ORF1 4301 4350 

PATENT . SEQ 4301 TGTACGAGCA TCTGAACTAT CATGCCTTCG AGCAAAAACT GGATAAATTG 43 50 

4360 4370 4380 4390 4400 

ORF3 4351 4400 

ORF2.SEQ 4351 4400 

ORF1 4351 — 4400 

PATENT. SEQ 43 51 TTTCAAAAAT GATTTACCGC TTTCAGACGG TCTTTCAACA AATCCGCATC 4400 

4410 4420 4430 4440 4450 

ORF3 4401 4450 

ORF2.SEQ 4401 4450 

ORF1 4401 4450 

PATENT. SEQ 4401 GGTCGTCTGA AAACCCGAAA CCCATAAAAA CACAAAGGAG AAATACCATG 4450 

4460 4470 4480 4490 4500 

ORF3 4451 4 5 00 

ORF2.SEQ 4451 4500 

ORF1 4451 4500 

PATENT. SEQ 4451 ACTGAAACTA CTCAAACCCC GACCTTCAAA CCTAAGAAAT CCGTTGCGCT 4500 

4510 4520 4530 4540 4550 

ORF3 4501 455 0 

ORF2.SEQ 4501 4550 

ORF1 4501 4550 

PATENT. SEQ 4501 TTCAGGCGTT GCGGCCGGTA ATACCGCTTT GTGTACCGTT GGCCGCACCC 4550 

4560 4570 4580 4590 4600 

ORF3 4551 4600 

ORF2.SEQ 4551 4 600 

ORF1 4551 4600 

PATENT. SEQ 4551 GGCAACGATT TGGAGCTATC GCGGTTACGA CATC TTGGAT TTGGGCACAA 4600 

4610 4620 4630 4640 4650 

ORF3 4601 46 50 

ORF2.SEQ 4601 4650 

ORF1 4601 4650 

PATENT. SEQ 4601 AAATGCGTTT GAAGAAGTAG CCCACCTGCT GATTCACGGT CATCTGCCCA 4650 

4660 4670 4680 4690 4700 

ORF3 4651 4700 

ORF2.SEQ 4651 4700 

ORF1 4651 4700 

PATENT. SEQ 4651 ACAAATTCGA CGTGGAAGCT TATAAAAGGA AGCTCAAATC CATGCGCGGC 4700 

47.10 4720 4730 4740 4750 

ORF3 4701 475O 

ORF2.SEQ 4701 4750 

ORF1 4701 4750 

PATENT. SEQ 47 01 CTGCCTATCC GTGTATTAAA GTTTTGGGAA AGCCTGCCTG CACATACCCA 4750 

4760 4770 4780 4790 4800 

ORF3 4751 4800 

ORF2.SEQ 4751 4800 

ORF1 4751 4800 

PATENT. SEQ 47 51 TCCGGATGGA CGGTAATGGC GTACCGGCGG TATCCATGCT GGGCTGCGTT 4800 
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4810 4820 4830 4840 4850 

ORF3 4801 4850 

ORF2.SEQ 4801 4850 

ORF1 4801 4 8 5o 

PATENT. SEQ 4801 CATCCCGAAC GTGAAAGCCA TCCCGGAAAG TGAAGCGCGC GACATCGCCG 4850 

4860 4870 4880 4890 4900 

ORF3 4851 4900 

ORF2 . SEQ 4851 4900 

ORF1 4851 4900 

PATENT. SEQ 4851 ACAAACTGAT TGCAGCCTCG GAGCCTCCTG CTGTACTNGG TATCAATATC 4900 

4910 4920 4930 4940 4950 

ORF3 4901 4950 

ORF2.SEQ 4901 4950 

ORF1 4901 4950 

PATENT. SEQ 4901 GCACAACGGC AAACGCATTG AGTTGAAGCG ACGAGAGACA TCGGCGGTCA 4950 

4960 4970 4980 4990 5000 

ORF3 4951 5000 

ORF2.SEQ 4951 5000 

ORF1 4951 5000 

PATENT. SEQ 4951 TTTCCTGCAA CTGTTNCACG GCAACGCCCA AGCGATCACA CATCAAAGCC 5000 

5010 5020 5030 5040 5050 

ORF3 5001 5050 

ORF2.SEQ 5001 5050 

ORF1 5001 5050 

PATENT. SEQ 5001 ATGCACGTTT CACTGATTCT GTATGCGAAC ACGAGTTCAA CGTTCTACCT 5050 

5060 5070 5080 5090 5100 

ORF3 5051 5100 

ORF2.SEQ 5051 5100 

ORF1 5051 5100 

PATENT. SEQ 5051 TTACCGTTTG CCGTTCTTCT GGTCGGTTCT AGCCCTGTAA AAAGAGAAGG 5100 

5110 5120 5130 5140 5150 

ORF3 5101 5150 

ORF2.SEQ 5101 5150 

ORF1 5101 5150 

PATENT. SEQ 5101 TTGTTAGCTG GCGAAGGTTT GCAGCCGTTA CAGTTTCCCG CGTTATAGCG 5150 

5160 5170 5180 5190 5200 

ORF3 5151 5200 

ORF2.SEQ 5151 5200 

ORF1 5151 5200 

PATENT. SEQ 5151 GCCAAGAAAC GAGTTTGGCG CACGGTGAGA ATTACCTGTT GCAACGCCCC 5200 

5210 5220 5230 5240 5250 

ORF3 5201 5250 

ORF2.SEQ 5201 5250 

ORF1 5201 5250 

PATENT. SEQ 5201 AGCCTTTACC ATATGTGGGC CTACTGGCTT NGGCTAGTGC TAAGAAACGC 5250 
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5260 5270 5280 5290 5300 

ORF3 5251 5300 

ORF2.SEQ 5251 530 o 

ORF1 5251 5300 

PATENT. SEQ 5251 GGCTATGCTA GCGCCTACAT GCCGAGTGAC GAGCGTNACG CCATCGCAAA 5300 

5310 5320 5330 5340 5350 

ORF3 5301 5350 

ORF2.SEQ 5301 5350 

ORF1 5301 5350 

PATENT. SEQ 5301 ACTTATACGC ATTTCGGGAA GCCAANCGCT GGCGGCACAA AGCCTGGATA 5350 

5360 5370 5380 5390 5400 

ORF3 5351 5400 

ORF2.SEQ 5351 5400 

ORF1 5351 54 00 

PATENT. SEQ 5351 GTTGTGCGGC TAACGNGGCC ATTACGACCT CATGTATAGT CCTCTGACAT 5400 

5410 5420 5430 5440 5450 

ORF3 5401 5450 

ORF2.SEQ 5401 54 50 

ORF1 5401 545O 

PATENT. SEQ 54 01 GGCGCTANTT GCGCCC 5450 
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